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Preface 


This book deals with the field of statistics and its use in the mak¬ 
ing of decisions. It represents an attempt to present a college text¬ 
book on elementary statistics from what may be called a behav- 
ioralistic point of view. 1 In this respect and in several others as well 
we feel that this book differs from any other introductory text in the 
field. 

The behavioralistic outlook regards statistics as a medium for 
deciding upon an alternative course of action. Thus, the emphasis 
in this book is on how to formulate a decision problem statistically 
so that data, when collected, will serve as a basis for deciding upon 
a rational course of action. The collection of data is, of course, never 
an end in itself at least it never should be an end in itself. In our 
opinion, this is the most important idea that an elementary course 
in statistics should convey to a student. Hence, we have attempted 
to make the relationship of data to decisions a central theme in the 
organization and presentation of material. Stress is laid not on the 
mere details of gathering statistical information, but on how such 
information is to be used once it is collected. 

In a w@rd, then, this book emphasizes planning —planning statis¬ 
tical studies so that when conducted they provide a basis for deci¬ 
sion. With this principle of planning as a foundation stone, other 
important aims of our text include the following: 

a) Developing in the student a skeptical attitude toward numerical 
data. 

b) Impiessing upon the student the necessity for looking behind 
figures to the procedure used to derive them. 

c ) Introducing a way of thinking, not only in terms of numbers, but 
in terms of uncertainty and risk. 

d) Indicating to the student that statistics is of ever increasing impor¬ 
tance in the field of business by presenting some of the many applications 
of business statistics. 


In so far as possible, material is presented to stress ideas and 
concepts and principles, rather than computations. Furthermore, 

The term behavioralistic” is borrowed, with essentially the same meaning from 
Leonard J. Savage, The Foundations of Statistics (New York: John Wiley &’Sons 
Inc., 1954). * 
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the approach is essentially nonmathematical. Techniques are treated 
only when they further statistical reasoning, and not as ends in 
themselves. The goal in this connection is to first bring the student 
to the point where he knows what statistics can do. Only where it 
does not detract from this objective is he taught the fine details of 
“how to do it.” As an aid in achieving these goals we include more 
than 250 examples of decision problems which can be solved through 
the use of statistics. These examples are an integral part of the text, 
and the student must be encouraged not only to read them, but to 
analyze them thoroughly. Our belief is that statistical concepts are 
best clarified by specific illustrations and that these examples moti¬ 
vate as well as instruct. 

With one or two minor exceptions only large sample statistical 
theory is treated throughout this text. For those who wish to illus¬ 
trate the thought that other theoretical distributions are useful, 
Appendix E very briefly presents t and F . 

The book may be viewed as consisting of two parts. The first part 
is represented by Chapters 1 to 10, which may be regarded as forming 
an introduction to statistics. These ten chapters are arranged to 
simulate the steps needed to plan a study for a decision problem 
which can be treated statistically. Thus, Chapter 1 treats material 
on identifying the problem, while Chapters 2 and 3 follow with 
ideas on how to define the problem in statistical terms. Chapter 4 
focuses on alternative methods of collecting information (secondary 
sources, census, sample, etc.). Chapters 5 through 8 then center at¬ 
tention on how to formulate plans for sample studies. Chapter 9 
warns against the imperfections that one may find in data. Finally, 
Chapter 10 deals with how one collects data after all of the plans for 
a study have been completed. Thus, we have postponed the collec¬ 
tion, tabulation, and presentation of data until analysis (a priori 
analysis) of data has been treated fully. At that time the student 
should appreciate why data are being collected. 

Chapters 11 through 14 form the second part of the book. Each 
chapter covers a special topic and each is an individual unit in itself. 
Alternative teaching plans for covering the material in the book 
are suggested in the teacher’s manual. 

Each chapter ends with a summary, a series of problems and ques¬ 
tions, and a bibliography. Each summary is simply designed to 
provide a way for the student to verify whether he has absorbed the 
main ideas in the chapter. The problems are also designed to provide 
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a basis for verifying whether meaning and understanding have been 
achieved. Additional problems and questions that require more 
computation will be found in the workbook designed to accompany 
the text. The bibliography includes only references geared to the 
expected level of ability of most students, since it is very discourag¬ 
ing to a student to find references that he cannot cope with. 

The three of us, and nine of our colleagues as well, have tried and 
tested drafts of Statistics for Business Decisions for six semesters at 
both the undergraduate and graduate levels. Approximately 2,000 
students have used the text during that period. We have found their 
response gratifying. Furthermore, we have been able to make many 
adjustments and revisions in the original drafts on the basis of their 
comments. The rise in the number of students desiring to pursue the 
further study of statistics has provided additional psychic income. 

In addition to our colleagues and students many other persons 
deserve our deepest and sincerest appreciation. Among these is 
Dean Thomas L. Norton who was instrumental in the introduction 
of a basic course in statistics required of all students at the School 
of Commerce, Accounts, and Finance, and who encouraged the 
writing of this textbook. His continuing interest hastened comple¬ 
tion of the task. 

To our good friend and colleague, Professor W. Edwards Doming, 
we owe many hours of time for reading and analyzing the manu¬ 
script. His suggestions were invariably valuable, and we only wish 
that we were able to incorporate them more adequately. Only those 
who know Dr. Deming and the heavy work-load that he carries can 
fully appreciate our obligation to him. 

Dean Erwin A. Gaumnitz of the School of Commerce of the Uni¬ 
versity of Wisconsin also read the manuscript, and his many sug¬ 
gestions concerning content, arrangement, and method of presenta¬ 
tion have helped us to make the text more understandable. Professor 
Julius H. Spalding, our colleague for many years, reviewed the 
earlier chapters for clarity of presentation and content. 

Mrs. Kay Degen typed the manuscript with amazing accuracy, 
and cheerfully and efficiently performed many other arduous tasks 
in connection with the development of this text. Miss Eve L. Crosby 
assisted us in reading the galley proofs and in typing the index. 

Also of assistance were Ruth, Susan, and Alice Kurnow, and Mary 
Jane Ottman. Laura Glasser provided the basis for Example 1.16. 
Lastly, but not leastly, we must thank our wives, Joyce Kurnow, 
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Anne Glasser, and Dorothy Ottman for two years of patience and 
forebearance, as well as hours of assistance in the preparation of 
the book. 

We are indebted to Professor Sir Ronald A. Fisher, Cambridge, 
and to Messrs. Oliver and Boyd, Ltd., Edinburgh, for permission to 
reprint Table IV (Table E.l. in this book) from their book Statisti¬ 
cal Methods for Research Workers; to Professor Fisher and to 
Dr. Frank Yates, Rothamsted, and to Messrs. Oliver and Boyd, 
Ltd., for permission to reproduce a portion of Table V (the .001 
values of Table E.2. of this book) from Statistical Tables for Bio¬ 
logical , Agricultural and Medical Research . We are also indebted to 
Professor Egon Pearson and to Biometrika for permission to repro¬ 
duce from Yol. XXXIII, the .05 and .01 values of F shown in 
Table E.2. of this book. 

Ernest Kurnow 
Gerald J. Glasser 
Frederick R. Ottman 

New York, NY. 

May, 1959 
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CHAPTER • 1 


Business Problems, Decisions, 

and Statistics 


1.1 The Plague and Statistics 

Throughout this book we shall be studying the theory and the 
methods of making rational decisions based upon numerical data. 
In particular, our concern will be with many of the problems aris¬ 
ing in the modern business world. 

As a simple starting point we can illustrate this general idea with 
an occurrence of great importance in the history of the subject we 
shall be studying—an occurrence which is nearing its 300th anniver¬ 
sary. It may be considered as an early illustration of decision making 
based upon numerical data. 

Example 1.1 During the seventeenth century there were several 
outbreaks of the plague in the city of London, and apparently a large 
proportion of the city’s inhabitants died during these unfortunate 
epidemics. Citizens followed announcements of births and, more¬ 
over, of deaths with considerable interest. At that time, in fact, 
summary information on the number of christenings and burials 
and records on the causes of death began to be published regularly. 

Although rough and certainly crude, this numerical information 
was of some interest, and quite in demand by some of the public and 
by businessmen. For as one writer has interpreted history: “the 
well-to-do watched these data to note the threat of infection so as to 
leave London in good time, while the merchants used them to draw 
business forecasts for the coming seasons” 1 


Corrado Gini, “The First Steps of Statistics,” Educational Research Forum 
Proceedings (International Business Machines Corporation, 1947), pp. 37-45. (Italics 
added.) 
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Perhaps history has been a little too conveniently interpreted 
here. Even so, make note of the idea of basing decisions in everyday 
affairs and in business upon numerical information. This will be the 
topic of central importance in our studies. 

Despite this rather undistinguished beginning and the unfortu¬ 
nate circumstance of originally being associated with the plague, 
the idea of decision based upon numerical information is the basis of 
an important field of study today. This field is known as statistics. 
However, the word “statistics” represents an unfortunate name 
since it is quite ambiguous. For that reason we had better pass from 
the 1600’s to the twentieth century and attempt to define it more 
precisely. 

1.2 Statistics and Statistics 

Although we have used the term “statistics” in the preceding sec¬ 
tion, we probably are using it with a slightly different meaning from 
that to which the student is accustomed. Let’s try to clear up this 
possible difficulty. Think back to the first time that you heard or 
used that word. What did it refer to? Probably it was a very sophis¬ 
ticated way of referring to numbers, or data, or numerical informa¬ 
tion. This use of the word is perfectly all right, but it is only one of 
two meanings that we shall use. 

In this first case, the word “statistics” refers to data such as the 
number of births and deaths, or the prices of scrap steel, or the 
wages of employees. As we shall see, statistical data of this type will 
be an essential ingredient in our studies, for data represent the in¬ 
formation upon which statistical decisions are based. We should, 
in fact, regard numerical information as our raw material. 

However, the subject that embraces the theory and the methods 
for making decisions from the data is also known as statistics. There¬ 
fore, in this second use of the word we are referring to a field of 
knowledge, and not merely to the data. Formalizing our present re¬ 
marks and adding them to previous comments we can arrive at a 
more complete introductory definition of our subject—this field of 
statistics. 

Let us say, for the present, that statistics involves methods and 
theory as they are applied to numerical data or observations with 
the objective oj making rational decisions in the face of uncertainty . 

This definition has implications which we will consider briefly for 
the moment and which we will review throughout the book. Firstly, 
the word “methods” refers to the statistical tools which are used to 
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make decisions on the basis of numerical information. In the course 
of our studies you will meet a number of these tools. Secondly, the 
word “theory” refers to the foundations of statistical methods. Put 
simply, the function of this theory is to make sure that proper tools 
are being used. Thirdly, the notion of making decisions in the face 
of uncertainty refers to the primary use of the methods and theory 
that we shall be studying. 

Example 1.2 A pharmaceutical company wishes to decide on 
whether to market drug A, drug B, or neither drug. They plan to col¬ 
lect data on the effectiveness of the two drugs by an experiment on 
guinea pigs. That is, they plan to collect statistical information upon 
which to base their decision. They plan to collect statistics (in the 
first sense of the word). 

How should these data be used to make the decision? How much 
data should they collect? What is the most effective way of running 
the experiment? What action should be taken on the basis of the 
data collected? What are the chances of their making the right mar¬ 
keting decision? These are all questions the field of statistics (the 
second use of the word) can help to answer. 

Methods are the formulas or other techniques which will specifi¬ 
cally answer these questions. 

Theory is the basis for these methods—the guarantee, so to speak, 
that the techniques used are the correct ones. 

The problem will have to be solved on the basis of incomplete in¬ 
formation since there are limits to the number of guinea pigs that 
can be used and hence limits on the amount of data that can be col¬ 
lected. Incomplete information, in turn, means that there will be 
some uncertainty in the decision. 

The problem falls within the realm of our subject. 

In short, remember that we have two uses for the word “statis¬ 
tics.” The first use simply indicates numerical data, while the second 
refers to a field of study. Grammatically, you should remember that 
in the first sense, the word is a plural noun (statistics are data), 
while in the second sense, the word is a singular noun (statistics is a 
subject). Although we shall make use of the plural (for raw mate¬ 
rials) , it is the singular with which this book will primarily deal. 

1.3 Decisions and Courses of Action 

In this book, we will, for the most part, restrict our discussion of 
the applications of statistics to business decision problems. This is 
only because our interests lie in this direction. This same subject, 
statistics, is also of considerable importance in many other fields, 
and similar methods and theory are also applicable there. Some of 
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these other fields are biology, medicine, psychology, agriculture, 
archaeology, and sociology. However, since most of our attention 
will be directed to the use of statistics in business and since statistics 
is concerned with decision, we now turn to survey, in the next few 
sections, some of the basic characteristics of the decision-making 
process in the business world. Our main interest and particular em¬ 
phasis in doing so will be to see what role statistics plays in this 
process. 

Suppose that we begin with a somewhat obvious but important 
point. Everyone makes some of his own personal decisions every 
day. What to wear? Where to eat? When to do this? How to do 
that? In business this same situation exists. Management must 
make certain decisions on business problems day after day. Some 
problems are minor; some are larger scale. Some are repetitive; 
some are relatively unique. Some are easily solved by snap decisions; 
some are more complicated. 

Despite this diversity, all decision problems are characterized by 
two factors. First of all, an important element that must exist in any 
decision problem is the principle of action. By action, we mean that 
a decision problem is a problem in deciding on a course of action 
and not a matter of satisfying one’s curiosity. A problem situation 
never calls for a decision unless action can be taken. 

In this connection, a life insurance company recently gave out 
some good advice to its policyholders: 

Don’t let the unsolved problems of the day pursue you all night. As 
problems arise during the day, solve them. Make your decisions, then for¬ 
get about them. Don’t keep wrestling with problems that really aren’t 
yours to solve. Most problems can be divided into two groups: those you 
can do something about and those over which you have no control . . . 
(The Connecticut Mutual Life Insurance Company.) 

This advice is equally well suited for management in dealing with 
its business problems. Furthermore, this advice must go hand in 
hand with the use of statistics. 

Statistics is concerned with many of those problems that you can 
do something about. If there is no control over the problem situa¬ 
tion and no action can be taken, nothing can be done about the 
problem anyway. Obviously, unless action can be taken on a prob¬ 
lem, there is no need to use statistics to help arrive at a solution for 
it. As you will see, if you haven’t already, we would only be wasting 
time, effort, and money. 

Statistics is useful in a large class of problems upon which man- 
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agement must make decisions, but statistics cannot be used to solve 
every problem. The point to remember here is that in decision mak¬ 
ing the tools of the statistician must be used as a basis for action if 
they are to be used profitably. Statistics provides the basis for action. 

If a businessman can’t act on a problem, then statistics can’t help 
him. 

The second factor which characterizes every decision problem 
may be summarized by the term “alternatives.” This refers to the 
fact that a decision problem exists only when more than one course 
of action is available. Specifying the existence of alternatives is sim¬ 
ply another way of saying that there is a managerial problem. If 
management has but one course of action open to it, no problem ac¬ 
tually exists and no decision is necessary. Management must take 
that course of action. 

Thus,^ the term decision” implies that management has various 
alternative courses of action open to it, and from these it selects one 
alternative. 

When applicable, statistics provides rational means of selecting 
this one alternative and rejecting all others. This procedure gives 
rise to the teinmnology—statistical decision theory . 

Suppose that we now turn to a few simple illustrations of busi¬ 
ness decision problems. For the present they should at least serve to 
stress how the concepts of action and alternatives appear in &ny de- 
cision-making situation. Ultimately we shall see how statistics can 
be used to help in solving these problems. 

Example l.S A company's division of production research develops 
what it thinks is a better and less costly gadget than those presently 
available. Several problems may immediately arise; each calls for 
some managerial decision. Should the gadget be marketed? If so 
what are the best markets? What is the best means of distribution? 

Of promotion? Of advertising? Each problem obviously has various 
alternatives associated with it. Incidentally, the company’s market 
research division, using appropriate statistical methods, can provide 
reliable answers to these questions if management wishes to use 
statistics as a basis for decision and action. 

Example 1.4 The section head of a company’s bookkeeping de¬ 
partment naturally wishes to have the error rate of his office’s post- 
ing operation as low as possible, even though he realizes some mis¬ 
takes are bound to occur occasionally. But he also wishes to keep the 
costs of doing this job at a minimum. Various alternatives are open 
to the section head, for he may choose the option of not checking any 
work done by the clerks, or he may choose to check all work done by 
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some clerks, or to check all work done by all clerks. He might, in 
fact, even decide to double-check or to triple-check all work. 

It must be assumed, of course, that he could carry out each of these 
possibilities. If he is short of help, he might not be able to check any 
work—he might have only one course of action—not checking— 
open to him. In such a case, no decision problem exists. 

Example 1.5 A manufacturer of oilcloth is faced with the problem 
of disposing of each roll as it is produced. The quality of a roll is 
measured by the average number of blemishes per square yard. The 
manufacturer considers three alternative courses of action: (1) to 
sell a roll as “first quality,” (2) to sell it as “second quality,” or 
(3) to scrap the roll. Thus the manufacturer’s decision and his course 
of action may vary from roll to roll, but in each case his decision will 
be based on the quality of the roll of oilcloth.. 

As you might guess, providing a method of estimating this quality 
or otherwise developing a basis for decision involves statistics. 

These three examples come from three different sectors of the 
business world. However, they all have various things in common. 
All problems are characterized by alternatives. They all call for deci¬ 
sion (the selection of one alternative) and action. In addition, they 
all can be treated by statistical techniques. As noted, these tech¬ 
niques will be studied in due time. DonT be impatient. 

1.4 States of the World 

In any problem certain alternatives are good. Some are bad. 
Some are better than others. Some are worse than others. But what 
do we mean by good, bad, better, or worse? Simply put for the mo¬ 
ment, our primary task in the next few paragraphs will be to try to 
give some meaning to the phrases good alternative or decision and 
bad alternative or decision. This is obviously a basic consideration, 
for in order to adopt rules for decision making we must have some 
sort of notion of why and when certain decisions are desirable. 

To be able to classify alternative courses of action as good or 
poor or inferior or superior or as something else, we must think in 
terms of the consequences of a given decision. For example, in any 
problem, if one of the decisions open to management will always 
yield better results than any other decision no matter what hap¬ 
pens, a wise course of action (and an obvious one) is to make that 
decision. We might say, that that particular alternative dominates 
all others. Then, no other course of action need be considered seri¬ 
ously. 

More often a different and more difficult situation exists. Under 
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certain circumstances one decision may be preferred to a second 
since the consequences associated with the first are more desirable 
However, under different conditions the first decision may be infe¬ 
rior to the second since the consequences associated with the second 
are more desirable. Thus, decision A may be better than decision B 
or decision B may be better than decision A, depending upon certain 
conditions. 

Consider the following nonstatistical but homey illustration: 

Example 1.6 Recently one of the authors had to visit a dentist 
whom he had not had the pleasure nor the pain of visiting previ¬ 
ously. He was told the office was at 654 Madison Avenue, New York 

City—somewhere in the 60’s; that is, between 60th Street and 69th 
btreet. 

He decided to ride the subway but absent-mindedly forgot to get 
more detailed directions. However, he did realize that the subway 
stopped at 59th Street and at 68th Street, among other places. 

He faced a decision problem with incomplete information His al- 
ternatives were to get off the train at 59th Street and to get off the 
tram at 68th Street. Which was a better alternative? This depended 
upon the location of the dentist’s office. If the office were at 62nd 
btreet a decision to get off the train at 59th Street would be prefer- 
able, if it were at 67th Street, the 68th Street stop would be better* 
and so forth. ’ 

The desirability of a particular decision depended upon a certain 
condition—the (unknown) location of the dentist’s office In the ab¬ 
sence of the information about this pertinent fact, the prospective 
patient had to. make his decision in the face of uncertainty about 
which alternative was the better. 

To be able to say with certainty which alternative course of ac- 
tion is best in a given decision problem, we ordinarily must know 
certain conditions about the situation we face. What are these con¬ 
ditions? In general, we will refer to them as the true state of the 
world. A true state of the world is a complete and accurate descrip¬ 
tion of a given problem situation. Note that in most problems there 
are many possible states of the world. Therefore, a basic question in 
decision making is which of these many states is actually the true 

one. Ordinarily, it is impossible to answer this question exactly_a 

matter we shall consider in the next section. First, consider two 
more examples. 

Example 1.7 A personnel administrator must decide whether or 
not to hire an applicant for a job. One way of defining states of the 
world m the problem is (1) potential success, and (2) potential fail- 
ure. Each of these states of the world describes the applicant’s char¬ 
acteristics which are pertinent to the decision problem 
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Example 1.8 An accountant wishes to value the end-of-year in¬ 
ventory of a wholesaler. States of the world are represented by possi¬ 
ble values of the inventory. One state of the world is $ 52 > 318 - “" 
other is $26,321.15. Still others are *121,300.01, $2,000, and *58,- 
111.58. Obviously there are quite a few possible states of the world. 

We can summarize some of our earlier remarks through the use of 
this new terminology. The desirability of alternatives depends upon 
which state of the world is true. Thus, if a decision maker knows 
which state of the world is true in a given problem, he should have 
no trouble in making a decision. For example, if the accountant of 
the last illustration knew that the true state of the world (the true 
value of the inventory) was $55,090.10, his decision on what value 
to attach to the inventory would be straightforward undoubtedly 
it would be $55,090.10. If the personnel administrator of the next- 
to-last example knew the state of the applicant, he would know 

whether to hire him or not hire him. 

Consequences refer to the results of making certain decisions 
when various states of the world are true. They are the outcomes of 
making a decision. Any outcome depends upon which alternative 
course of action is selected and upon which state of the world is true. 

Example 1.9 A manufacturer produces a certain type of fuse, 
whose quality he measures by whether or not it blows out at 20 am¬ 
peres. The manufacturer usually sells his product m lots of about 
1000 fuses, and he has the following alternative courses of action: 

(1) sell a lot as first grade for $100 (10* a fuse) with a double-your- 
money-back guarantee for each fuse that proves defective and 

(2) sell a lot as second grade for $85 (8%* a ^ use ) without any 

g States of the world may be described by the number of defectives 

m Xf he adopts alternative (1), the manufacturer will make $100 per 
lot less 20* per defective fuse (assuming that all defectives are re¬ 
turned under the guarantee). However, if he adopts alternative 
(2), the manufacturer will make a straight $85 per lot. 

Thus the outcomes for a few of the many possible states of the 
world and for the two alternative courses of action may be summa¬ 
rized as follows: 


State of the World 
(Number of Defective Fuses) 
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Thus, for any. given lot, if the manufacturer knew that only 10 
fuses were defective, he would choose alternative (1) and make 11 Of) 
less $2.00 (20* times 10), or $98. This is the desiwe aTemaS 
since under alternative (2) he would make only $85 

On the other hand, if he knew that 100 fuses we're defective he 
would choose alternative (2) and take the $85 since under alterna¬ 
tive (1) he would make only $80 ($100 less 100 times 20<0 

In genera 1 , the student should satisfy himself that if less'than 

, d itT are In * Sh ° uld be sold under alternative 

(1) , wh le if more than 75 defectives are in a lot, it should be sold 

under alternative (2) If'exactly 75 defectives ar’e in a lot, neith r 
alternative is preferable to the other. 

. ° ne ^ word on , this P^em. We have assumed that the gross 
Z m h r ° m ad ® c l uatel y measures the consequences of a deci- 

!! ° f ° Ur dlscussion is - therefore, based on the 

premise that no damage to the manufacturer’s reputation occurs be- 
cause of returned fuses. If this premise is not met, our sketch and 
analysis of the problem would be somewhat inadequate. 

In many problems it is quite difficult to measure consequences 
numerically. Often we can only evaluate them qualitatively. 

Example 1.10 The personnel administrator we met before (in Ex- 
ample 1.7) considers the consequences of hiring or not hiring an appli- 

he app , h< ’, a " t I s bred and he is a Potential success, the firm 
would be rewarded by having a competent employee. If the candidate 
is hired and he is a potential failure, the firm would be wasting time 
and money on his training. In addition, the hiring of a noncompe- 
tent person might have an adverse affect on the morale of other em- 
ployees and might result in incalculable damage to customer rela- 

In either event, you will observe, the consequences of the action 
o hiring depend on the true state of the world. Further, the conse¬ 
quences in this situation cannot easily be converted into dollars and 
cents but must be expressed qualitatively in terms of advantages or 
disadvantages to the firm. 6 

Similarly, if the action is not to hire the candidate and he is a po¬ 
tential success, the firm, as a consequence, would have to forego the 
services of a capable person. It might have to leave an important po- 
sition unfilled and thus interfere with the progress of an important 
program. On the other hand, if the candidate is a potential failure 
then the consequence of the action-not to hire-would save the firm 
the trouble and expense of trying to train a noncompetent person 
Again the consequences are related to the action taken and the true 
state of the world; and again the consequences are difficult to meas- 
ure numerically. 

However the thought processes of the personnel administrator in 
analyzing the consequences of his alternate actions may be summa¬ 
rized qualitatively as follows: 
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State of the World 

Alternative 

Potential 

Success 

Potential 

Failure 

Hire. 

Desirable 

Undesirable 

Not hire. .. 

consequences 

Undesirable 

consequences 

Desirable 


consequences 

consequences 


In short, the first step towards the solution of any decision prob¬ 
lem is to spell out the alternative courses of action, the states of the 
world, and the consequences associated with each possible outcome. 
This is a specific way of stating a common-sense rule the first step 
towards the solution of any decision problem is to recognize what 
problem exists. 

1.5 Uncertainty, Observations, and Sampling 

Whenever a decision maker knows which state of the world is 
true, he should be able to make a rational decision since a state of 
the world is supposed to be a complete description of the pertinent 
facts in a problem situation. 

Unfortunately, many business decisions must be made without 
knowing the true state of the world. They must be made in the face 
of incomplete information about which state is actually true. These 
problems are decided under uncertain conditions. A manufacturer 
cannot know the number of defective fuses in every lot upon which 
he must make a decision. A personnel administrator cannot know 
with certainty whether each applicant coming before him is a po¬ 
tential success or a potential failure. So it is in many other types of 
decision problems: there exists some degree of uncertainty about 
the true state of the world. 

The objective of using statistics is to recommend action in the 
face of this lack of complete information or in the face of this uncer¬ 
tainty. Put another way, one of the functions of statistics is to pro¬ 
vide information on which state of the world is true. This reduces 
the amount of the uncertainty that exists. 

And put still another way, the objective of statistics is to provide 
observations or numerical information to be used as the basis for de¬ 
cision. This ordinarily does not mean that statistics involves count¬ 
ing the number of errors in all paper work, or counting the number 
of defectives in all product, or measuring the diameters of all ball 
bearings in a lot, or asking all television viewers their favorite pro¬ 
grams. These usually are expensive or otherwise undesirable ways 
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of providing the required information. Rather, one may utilize the 
theory and the methods of statistics so that he may get the neces¬ 
sary information in the quickest, cheapest, and most efficient way. 
. Jt must alwa y s be remembered that the acquisition of observa¬ 
tions or data costs money, time, and effort. We must, therefore, al¬ 
ways weigh the cost of added information against the gains derived 
trom reducing the amount of uncertainty. It would obviously not 
be wise to spend $1,000 to gather information that might save $100 
or yield an added income of $100. We must always try to find the 
most economical method of reducing uncertainty. For this and other 
reasons we will consider ultimately, statisticians often find it de¬ 
sirable to provide only partial or sample rather than complete infor- 
mation. 


Example 1.11 In an earlier illustration, a manufacturer of fuses 
had to decide for each of his lots of product whether to sell under a 
guarantee or without a guarantee. One statistical solution to this 
problem might be for him to take a sample of 40 fuses from each 
o ® might then utilize this sample as a basis for action on the lot 
by testing the 40 fuses and selling the rest of the lot under the guar¬ 
antee^ and only if there are less than 3 defectives in the sample. 
We will return to this type of problem for the important details in a 

of samplffig 1 "’ f ° r th<! PreS6nt y ° U Sh ° Uld “ erely consider the idea 

Decision making based on sample information may seem like a 
risky business. That it is. Any time decisions are made on the basis 
of incomplete information there are risks of making an incorrect de¬ 
cision—a decision which would not have been made if complete in- 
formation had been available. 

Example 1.12 The manufacturer of fuses, last noted one para- 
graph ago, runs some risks if he adopts a sampling plan such as the 
one sketched in the last illustration. 

i .T he ^.t 1S i a nsk tllat lf , by chance the sample is poor, he will sell the 
lot for the lower price without the guarantee, although actually it is 
of good quality and should be sold under a guarantee. 

On the other hand, there is the risk that if by chance the sample is 
good, he will sell the lot with a guarantee, although actually it is of 
lower quality and should be sold without a guarantee. 

Despite the risks due to sampling, the manufacturer cannot get 
complete information by testing every fuse. Why? 

Example 113 A production manager in a plastics company must 
estimate the total number of widgets he should produce next week. 

He knows some widgets will be defective, so to produce 10,000 good 
widgets he must plan a total output of over 10,000 

His alternatives are 10,000, 10,001, 10,002, ..., 11,000 



L2 STATISTICS FOR BUSINESS DECISIONS [Oh. 1 

12 000 There are many risks of incorrect decisions in this prob¬ 
lem For example, one incorrect decision is producing 10,000 widgets 
when 10,500 will be needed. Another is producing 12,000 when 11,- 

111 will be needed. , 

Note, however, some incorrect decisions are not as bad as otners. 
This is judged on the basis of consequences, the costs of overproduc¬ 
ing and of underproducing. Thus, overproducing by 10 items ordi¬ 
narily would not be as bad as underproducing by 200 items. Cer¬ 
tainly overproducing by 10 items is not as bad as overproducing by 
100 items 

Also keep in mind this bit of philosophy: The more costly the con- 
sequences of an incorrect decision, the smaller the risk one should be 
willing to take of making that incorrect decision. 

Despite the risks of incorrect decisions whenever sample rather 
than complete information is obtained, sampling often is preferable. 
Savings in money and in time are but two of the reasons why one 
might prefer to collect partial data. A statistician recognizes t ese 
advantages but at the same time realizes the risks. That is his job. 

In most problems, therefore, he insists on a particular type of 
sampling which is termed “probability sampling.” Samples are 
drawn or considered in terms of the mathematical theory of proba¬ 
bility, a procedure which allows a decision maker to measure the 
risks inherent in the process of making decisions based upon incom¬ 
plete or sample information. This whole area represents another 
central theme which we will meet again and again in our studies. 

It should be remembered from the outset of our studies, how¬ 
ever that in some problems a statistician will not recommend the 
use of a sample. Rather he may say: “Make your decision without 
the benefit of observation. It is not feasible to attempt to get any 
information—it will yield less than its cost.” Or, m the other ex¬ 
treme type of problem, he may say: “Utilize a complete count and 
not a sample for this problem—it will be more economical. 

In conclusion it might be stated that one uses statistics to set up 
a procedure for obtaining and interpreting observations. However, 
any such plan of attack must be centered around the task of solving 
the business problem on which he is working. And remember that 
by the term “solving” we mean affording management a rational 

basis for decision. 

1.6 Testing and Estimation Problems 

At various points throughout this text we shall classify and define 
different types of statistical studies. This section presents the first 
of these classifications. It deals with the difference between statist 1 - 
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cal testing problems and statistical estimation problems. These two 
types of studies both have the same general purpose: providing a 
basis for decision and action on a business problem. The distinc¬ 
tion is important, however, in the design of a study—different plans 
must be made for testing problems than for estimation problems, 
and vice versa. 

The feature that distinguishes testing problems from estimation 
problems is the nature of the alternatives available to a decision 
maker. If the alternatives are nonnumerical or qualitative, the 
problem may be regarded as a testing problem. On the other hand, 
if the alternatives are numerical (quantitative, that is), the prob¬ 
lem may be regarded as an estimation problem. 

Thus, testing problems are characterized by the fact that man¬ 
agement has two or more qualitative alternative courses of action 
and decision available. The purpose of a statistical study is to decide 
which course of action appears to be better on the basis of numeri¬ 
cal observations . We have had several illustrations of testing prob¬ 
lems, the last of which was contained in Example 1.12. There the 
decision problem was to classify a lot of fuses as first grade or as sec¬ 
ond grade and accordingly to sell the lot with or without a guar¬ 
antee. The alternatives are nonnumerical; therefore, the problem is 
treated as a testing problem. 

Estimation problems are characterized by the fact that manage¬ 
ment has quantitative alternatives available. The purpose of a sta¬ 
tistical study is to recommend one of these numbers for use. Put 
another way, the purpose of a study in an estimation problem is to 
estimate some quantity. Example 1.13 dealt with an estimation 
problem—estimating the level of production. For another illustra¬ 
tion consider the following: 

Example 1.14 A small company manufactures various electronic 
tubes, including a special type of transmitter tube. At the end of the 
fiscal year the company accountant must find the approximate value 
of the tubes in inventory. The number of tubes on hand is known, but 
the accountant realizes that some tubes are not saleable because they 
are broken or defective. He concludes, obviously, that these tubes 
should not be valued in the inventory. 

In any event, the accountant’s alternatives in valuing the in¬ 
ventory are numerical, being typified by such numbers as $10,125, 
$235, and $321,123. The problem is an estimation problem. 

Incidentally, a statistical solution to this problem would be to 
base an estimate of the value of the inventory on a sample of trans¬ 
mitter tubes. If this solution were implemented, what risks would the 
accountant be taking? 

THE BUNT LIBRARY 
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Once again, remember that whether they are used in testing or in 
estimation problems, statistical theory and statistical methods are 
decision-making tools. This follows immediately from the thought 
that both testing and estimation problems are types of decision 
problems. 

1.7 The Principle of Planning 

A decision problem exists when there are two or more alternative 
courses of action. Decision is a summary word for the process of se¬ 
lecting one of these alternatives. The desirability of any decision is 
measured by the consequences of that decision. These consequences 
depend not only on the action decided upon but also on which state 
of the world is true. 

Many, if not most, business decisions must be made in the face of 
uncertainty. This means a decision maker is not sure which state of 
the world is true. This, in turn, means that he is not sure of what 
consequences will result from any decision he might make. Hence, 
he is uncertain about which decision to make. 

Uncertainty arises when a decision maker has incomplete infor¬ 
mation about which state of the world is true. Statistics is a field of 
study in which there exist theory and methods for reducing this un¬ 
certainty, within economic limits, by providing sample information. 

Sample information will not remove all the uncertainty from a 
decision problem. There will still be some uncertainty, hence some 
risks of incorrect decisions. But with the use of statistical theory and 
statistical methods, these risks can be measured and controlled. 

Because of the fact that incomplete information is the rule 
rather than the exception, too often the gathering of information 
becomes the thought-consuming problem to a decision maker. Too 
often the decision problem thus simply reverts to a data-gathering 
problem. Too often little thought is given to how data will be used 
once they are obtained. The unfortunate result is that too often 
data are gathered but, when available, they furnish no guide to ac¬ 
tion. 

Example 1.15 A consulting firm was retained by a municipality 
to assist in the relocation of residents whose dwellings were to be 
condemned as part of a slum-clearance program. The municipal of¬ 
ficials stressed their desire to carry out the relocation program with 
a minimum of disturbance. They felt that social and economic data 
about the residents were of the utmost importance in attaining this 
objective. 

Armed with this advice, the consulting group made two visits to 
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the area. It then seriously considered whether or not to set forth upon 
a third data-gathering expedition. 

At this stage, one of the authors was invited to enter the picture. 
He naively inquired if the consulting firm had planned what to do 
with the data after they were acquired. How would they aid in the 
relocation of families? He was surprised to hear that a plan for uti¬ 
lizing the data had not been formulated. Unfortunately, the munici¬ 
pal officials in presenting the problem had put such great emphasis on 
the paucity of data that the collection of data had become an end in 
itself. The failure to do a little a priori planning had resulted in 
much wasted motion and cost. 

Alternate programs of relocation should have been worked out be¬ 
forehand. The data necessary for the selection of an alternative 
could then be gathered. In any event, the data needed to reach a de¬ 
cision would have been more readily apparent. 

To gather data that will not help one make a decision is wasteful 
of time and money. In addition, it is alien to common sense. It is 
also alien to the field of statistics. Most statisticians view the proper 
function of their subject as relating decisions to the data one plans 
to collect before the data are actually collected. This philosophy 
calls for a decision maker to quiz himself as follows: “What will I do 
with these data?” “What decision will I make if I observe a certain 
result?” 

Statistics has in recent years developed to the point where the 
emphasis is not on data but on design for decision —designing a 
study so that it does more than furnish facts—so that it directly 
leads to decisions. This point of view, therefore, stresses a priori 
planning. 

This idea can be implemented in many studies by incorporating 
a statistical decision rule into the plan of attack. In such a study the 
decision maker formulates his possible decisions a priori, that is, be¬ 
fore the actual observations are made. However, these a priori de¬ 
cisions are conditional upon what he observes. A simple form of 
a statistical decision rule is: “Take a sample of such a size in such a 
way; if we observe so-and-so in our sample, then we will do this; 
otherwise we will do that.” 

To be more specific about such, so-and-so, otherwise, this and 
that, let's look at two illustrations covering a personal decision prob¬ 
lem and a personnel decision problem in that order. 

Example 1.16 A homey example of an a priori decision rule was 
recently implemented by one of the authors and his wife. Expectant 
parents, they faced the decision problem of naming their expected 
arrival. They made their decision a priori, relating the action to be 
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taken to an observation which was to be made subsequently—the sex 
of the child. Their rule was as follows: If the baby is a boy, name 
him Louis; if the baby is a girl, name her Laura. 

Their decision rule was somewhat incomplete, but only in the 
sense that they did not provide an action for twins, triplets, etc. For¬ 
tunately, for more reasons than one, their rule did suffice. 

Of course, one need not be a statistician to use this type of rule. 
Thousands of expectant parents have used a similar plan—it is quite 
commonplace, to say the least. 

Example 1.17 A personnel administrator may use the following 
decision rule in deciding whether or not to hire secretarial help: If 
the girl can type at least 60 words per minute and take shorthand at 
the rate of at least 100 words per minute and score over 75 on a given 
aptitude test, hire her; otherwise don’t hire her. 

Note in particular that the rule relates decisions to observations 
before the observations are made, as all good decision rules must. 

Throughout this book we will be studying the theory and the 
methods of constructing rational decision rules. The word “rational” 
is important in this sentence—it means that we will be studying 
ways of constructing rules to achieve certain managerial objectives. 
We will learn more about this as we go along in our studies. 

1.8 The Principle of Teamwork 

Planning is a basic principle of statistical applications. Who is 
responsible for this planning, and how is it achieved? Briefly, plan¬ 
ning is a team function and may be achieved only through team¬ 
work. 

Most business problems are solved not by one man but by a 
group of men working in concert. Such a group, insofar as they have 
identical interests in mind, may be thought of as a team. 

Depending on the nature of the decision problem, the general 
manager, the production expert, the marketing man, the sales man¬ 
ager, the advertising director, the controller, the internal auditor, 
and/or the credit manager may be important players on a given 
team. Throughout this book, we will take the liberty of grouping all 
these players together and, for convenience, call them management. 
Management is that group which directs the strategies and makes 
the decisions of a firm in the game called business. 

In some problems—those that are statistical—a statistician is a 
member of the team. He performs a useful function. In this text, 
we will, of course, be interested primarily in what position the stat¬ 
istician plays on this team and what he can contribute to the play- 
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ing of the game of business. We will often refer to management 
and the statistician because our interest is primarily in statistics. 
But this should not imply that the latter is a separate entity or the 
individual star. A statistician works with others who direct the 
team/s strategies, and he works for the team. Remember, the statis¬ 
tician is an important part of a business team, but he is important 
only as part of that team. 

Sometimes one player may act as a statistician and also as, say, 
a marketing expert. Sometimes, in fact, one player may be the 
whole team. In any event, we are going to study the statistical part 
of the teamwork that is required to solve many business problems— 
to make decisions in the face of uncertainty on the basis of numeri¬ 
cal information. 

We have stressed the idea of a team because co-operation is the 
key to the planning of any statistical study. Many times the nature 
of a business problem may, at first, be somewhat vague. The sub¬ 
ject matter experts on the business team and the statistician must 
then sit down for a conference and discuss in detail the nature of 
the problem. The alternatives must be listed, and the possible con¬ 
sequences of each alternative thought out. States of the world must 
be defined. Given a clearer view of the whole problem, the team 
must then decide if the problem is statistical. If so, then they must 
also decide upon the appropriate statistical framework. This in¬ 
volves other elements which the team must formulate, and to which 
we shall turn in Chapters 2 and 3. 

1.9 You Should Now Know That 

Statistics are data, but, in addition, statistics is a field of knowl¬ 
edge. 

The field of statistics involves methods and theory as they are ap¬ 
plied to numerical data or observations with the objective of mak¬ 
ing rational decisions in the face of uncertainty. 

Statistics provides a basis for action in business decision prob¬ 
lems. 

A decision problem exists when two or more alternative courses 
of action are available. In many problems statistics provides an ob¬ 
jective basis for selecting one of these alternatives. 

A state of the world is a complete description of the pertinent 
facts surrounding a given problem situation. 

Consequences are the results of making a certain decision when 
a certain state of the world is true. 
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In many problems a decision maker may be uncertain about 
which state of the world is true, hence uncertain about which de¬ 
cision to make. 

Statistics provides methods of making sample observations 
which reduce the uncertainty connected with business problems. 

Statistical studies deal with either testing problems or estimation 
problems. 

The proper function of statistics is to relate decisions to the data 
one plans to collect befove the data are actually collected. This is 
the principle of planning. 

Statistical studies usually must be formulated by management 
and the statistician. This is the principle of teamwork. 

1.10 You Should Now Be Able to Solve 

1. Some states require a minimum bodily injury liability coverage 
on automobiles licensed in their state. Many automobile owners, 
however, decide to carry as much as 10 times the amount of mini¬ 
mum insurance. 

Discuss the nature of the reasoning that results in the decisions 
of these individuals to carry large amounts of insurance. Frame 
your answer in terms of alternatives, states of the world, uncer¬ 
tainty, consequences, and numerical information. 

2. You have been accepted for admission to three graduate schools 
to which you have applied. You have a month within which to 
choose one, or none, of them. Is this a statistical decision problem? 
Discuss the problem in terms of alternatives, states of the world, ac¬ 
tion, numerical data, consequences, and uncertainty. 

3. Discuss the adage “don’t cross your bridges until you come to 
them” generally in terms of statistical decision theory and particu¬ 
larly in terms of the principle of planning. 

4. Every month the Department of Commerce conducts a sample 
survey of retail sales in the United States. Retail sales for March, 
1958, were reported to be $15.6 billion. 

a) Is this an illustration of a testing or of an estimation problem? Ex¬ 
plain. 

b) What are the alternative courses of action in such a problem? 

c) Is there any uncertainty in this decision problem? Why? Does the 
sample survey eliminate the uncertainty? Why or why not? 

5. You must decide between going to the movies with a friend to¬ 
night and studying for a statistics quiz on Chapter 1 that may be 
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given tomorrow. Consider this problem in terms of the concepts and 
principles of decision theory. 

Make a decision and explain the reasoning process which you 
used in making the decision. 

6. A department store's management wants to study, on a day-to- 
day basis, the records and receipts of the clerks responsible for re¬ 
funds. The internal auditor asks what the purpose of the study will 
be (will they retain or fire certain girls or maintain or change the re¬ 
fund system, etc.). Top management says, “Let's do the study first, 
and then we'll formulate a course of action." Is there anything 
wrong with this answer? Why or why not? 

7. Select some decision problem with which you are familiar from 
your own field of specialization. Discuss in terms of alternatives, 
states of the world, consequences, uncertainty, etc., and in terms of 
how you think statistics might help in arriving at a decision in the 
problem. 

8. An investor in the stock market must decide whether to hold or 
to sell his portfolio of stocks. Discuss this situation as a decision 
problem. 

1.11 You Will Also Find That 

The ideas of decision, alternatives, action, et. al., are covered lu¬ 
cidly in: 

Bross, Irwin D. J. Design for Decision , pp. 1-32. New York: Mac- 
millian Co., 1953. 

Decisions in the face of uncertainty, states of the world, conse¬ 
quences, and related topics are discussed very clearly in: 

Savage, Leonard J. The Foundations of Statistics, pp. 6-10, 13-17. New 
York: John Wiley & Sons, Inc., 1954. 

Some further examples and remarks about the nature of statis¬ 
tics and statistical planning are given in: 

Sprowls, R. Clay. Elementary Statistics, pp. 1-8. New York: McGraw- 
Hill Book Co., Inc., 1955. 

For some basic thoughts on what statistics is and/or what sta¬ 
tistics are, the following are key references: 
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CHAPTER • 2 


Statistical Populations 


2.1 Defining the Problem Statistically 

Our first chapter was devoted to a discussion of some of the as¬ 
pects and principles of decision making. Many of the ideas are 
rather general—they apply to a variety of business and personal 
problems. Naturally, there are many such problems. There are also 
many ways of arriving at decisions. 

As was pointed out, one of the ways of arriving at decisions in 
some problems is through the effective use of statistical methods 
and statistical theory. Thus, if any problem is such that there is un¬ 
certainty about the true state of the world and numerical data will 
help reduce this uncertainty, then that situation presents, at least in 
part, a statistical decision problem. And if one is faced with a sta¬ 
tistical decision problem, the next step in the decision-making proc¬ 
ess is for management and the statistician to translate it into sta¬ 
tistical terms. 

Incidentally, this translation is not wholly a statistical job. It is a 
team job. It requires the combined efforts of the subject matter ex¬ 
perts and the statistician who are working on the given problem. 

The statistical definition of a decision problem usually involves 
two integral parts. If you will tolerate the use of technical terms for 
a moment, we may call these two things: 

1. Defining the statistical population , and 

2. Specifying the decision-parameter. 

This chapter is devoted to developing the idea of defining the 
statistical population. In statistics the term “population” is a spe¬ 
cial word with a special meaning—you should immediately forget 
the conventional meaning you attach to this word. In its place, re- 
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member that a population is the totality of all pertinent observa¬ 
tions that might be made in a given problem,. You might also make 
note of the fact that an often-used synonym for population is uni¬ 
verse. On the other hand, if only a part of a population is selected in 
a study, that part is called a sample. 

The second important part of the statistical definition of a prob¬ 
lem, specifying the decision-parameter, will be considered in some 
detail in Chapter 3. As you will see, it involves methods of describ¬ 
ing the state of the world in a problem; that is, it involves ways of 
statistically summarizing the pertinent facts about a population. 

. Exam P le 2 -l A marketing research study is planned to provide 
information for a decision as to the amount of money a company will 
spend for a local advertising campaign to promote its oleomargarine. 
Included m the type of data desired is the amount of oleomargarine 
used by the families in the specified area. 

The statistical population consists of the different amounts of ole¬ 
omargarine the various families use. If the population could be made 
known, it might be written (in terms of the number of pounds per 

week) as 0, 1.5, 0, 1, .25, 0, etc., where each number is associated 
with a given family. 

Thus, the population for this problem is a set of numbers—each 
number is a possible observation and one bit of information. The 
totality of these numbers, or observations, or bits of information is 
the population. ’ 

In practice, it might be costly and time consuming to study the 
population completely. However, some information (from a sample 
of the families) might be obtained by the use of statistical methods 
and a decision made on the basis of the observations in the sample. 

A decision-parameter, incidentally, is some characteristic of this 
population; for example, the average amount of oleomargarine used 
by all the families, or perhaps the proportion of nonusers. But, as 
noted, let’s save this for the next chapter. 

Although we now are going to spend a full chapter on popula¬ 
tions, statistics deals primarily with the making of decisions On the 
basis of samples. Complete or population information is often un¬ 
obtainable. Although there are exceptions, usually one must settle 
for sample or partial data as a basis for decision making. 

Despite these facts, it is worthwhile, at this stage of our studies, 
to ask ourselves what we would do if complete information—the 
population—were available or could be made available. It is worth¬ 
while because this is a question which management and the statis¬ 
tician must invariably ask in the planning stage of any proposed 
statistical study. 
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First of all, an answer to this question points up clearly what type 
of information is needed. Whether this information can be obtained 
is another matter. After all, before thinking about whether you can 
get certain information, you should think about what information 
you need to make the decision that you are facing. Secondly, think¬ 
ing about complete information focuses attention on the decision 
problem, per se. It directs attention to possible actions and not to 
the gathering of data which may turn out to be useless when col¬ 
lected. 

Thus, thinking about complete information in the planning stage 
of a study is a fruitful procedure. It emphasizes what we have called 
a priori planning, one of the cardinal principles of statistics. 

2.2 Elementary Units and Their Characteristics 

Let’s examine the whole idea of observations in more detail. 
Many statistical studies involve observation of characteristics of 
what may be called elementary units . For example, one decision 
problem may involve observations of the quality of fuses. Thus, an 
elementary unit is a fuse; the characteristic to be observed is its 
quality. Alternately, a problem may require observations of the net 
income of corporations in 1958. In this case, an elementary unit is a 
corporation; the characteristic to be observed is net income in 1958. 

Example 2.2 To extend our introduction to the present topic, we 
may consider the following list which presents a variety of elemen¬ 
tary units and some of their characteristics which may arise in var¬ 
ious statistical decision problems: 


Elementary Unit 

An apartment. 

A dwelling unit. 

A family unit. 

A student. 

A person. 

A firm. 

An account receivable. 

A common stock. 

A product. 

A worker. 

An employee. 

A bank. 

A radio tube. 

A steel rod. 

A steel rod. 

A steel rod. 


Characteristic 
Availability 
Type of heating 
Annual income 
Quiz grade 
Marital status 
Net profit per sales dollar 
Amount due 
Dividend rate 
Price 

Employment status 
Absences in one month 
Number of depositors 
Time to failure 
Tensile strength 
Diameter 
Weight 


We have already defined a statistical population as the total set of 
observations that may be made in a study. In our new terminology, 
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you may also think of a population as the complete set of elemen¬ 
tary units. In any event, remember that those observations actually 
made in most statistical studies are only a sample—a part, but not 
all, of the population. 

Example 2.3 The marketing research study planned to provide a 
basis for action on an advertising campaign in Example 2.1 involved 
the amount of oleomargarine used by families located in a certain 
area. 

Note that: 

1. The elementary units in this study are the families in the speci¬ 
fied area. 

2. The characteristic of interest for each family is the amount of 
oleomargarine it uses in a given week. 

3. The whole set of observations of this characteristic is the popu¬ 
lation under study. 

4. Generally a statistical study would involve observing only a 
sample. Why? 

Example 2.4 A company accountant must decide whether or not 
to check this week’s payroll vouchers for any numerical or other mis¬ 
takes that may have occurred. 

Note that: 

1. In such a problem, the elementary units are all the payroll 
vouchers under consideration. 

2. The characteristic of interest for each of these units is its qual¬ 
ity. This, in most applications, may be defined as either “in er¬ 
ror” or “correct.” 

3. Once again, the total set of these observations is the population. 

4. Decisions would ordinarily be based on a sample of these ob¬ 
servations. 

Example 2.5 A manufacturer receives a large quantity of a spe¬ 
cial machine bolt that he uses in his production process. He must de¬ 
cide whether or not to accept the lot. He can only use a bolt if its di¬ 
ameter is between .2405 inches and .2415 inches. 

Note that: 

1. An elementary unit in this study is a bolt. 

2. The characteristic of interest for each bolt is its diameter. 

3. The whole set of diameters is the population. 

4. Usually, however, a decision will be based on a sample of the 
diameters. Why? 

Also note that an alternative definition of the characteristic of an 
elementary unit is available for this problem. Those bolts between 
.2405 inches and .2415 inches might be classified as “good” and oth¬ 
ers as “defective.” Then this set of observations is the population, 
and a sample of these observations may be used as a basis for deci¬ 
sion. 
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The last illustration points up the fact that in a given problem 
there may be different ways of specifying the population, simply 
because there may be different ways of observing characteristics. 
However, while the choice of an appropriate characteristic, and the 
choice of the appropriate elementary unit as well, depend upon 
many factors, they usually follow from the nature of the decision 
problem. Nevertheless, it should be stressed that these choices must 
be made early in the planning stage of a study by the subject matter 
experts and the statistician working in concert. 

Example 2.6 A household appliance company wishes to deter¬ 
mine, within a certain area, the number of dwelling units in which 
there are electric refrigerators, in which there are gas refrigerators, 
etc., so that they may better plan their sales campaign in that area. 

A sample study of dwelling units is considered. Thus, if the study 
is to be made, it seems logical to define a dwelling unit as the elemen¬ 
tary unit. This is a start—but we must be more specific about the 
definition of the dwelling units. Should apartments be included? 
What about hotel rooms with kitchen facilities? Should lodging or 
boarding houses be included? 

Answers to these questions must be provided in the planning stage 
of the study. The elementary unit must be defined very specifically 
before the survey itself is even begun. 

There is an important lesson to learn from this illustration. Al¬ 
though the definition of the elementary unit is an important con¬ 
sideration in a statistical study, it is not exclusively a statistical prob¬ 
lem. Its solution depends, of course, upon what type of information 
is wanted by the company, and this must be clearly described by the 
marketing, advertising, and sales experts who are working on the 
problem. 

Example 2.7 A common problem in production control involves 
deciding whether to let a machine continue to operate as it is or to 
stop the machine and reset it. It is also commonplace for such deci¬ 
sions to be based on a sample of product taken from that already 
produced by the machine. What should be observed by, for example, 
the shop foreman when he samples the product? The elementary unit 
is a part, or a can, or a fuse-—-whatever product is being pioduced. 
But which characteristic of the product is of interest? 

If the product is a steel rod, such quality characteristics as break¬ 
ing strength, diameter, weight, or length might be observed. If the 
product is a can of fish, the weight, the aroma, the flavoi, or the ap¬ 
pearance of the fish might be checked. Should all of these character¬ 
istics be observed? Or, if not, how are the characteristics of interest 
to be selected? 
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A general rule is to observe those characteristics which, in the past, 
have led to the most expensive and serious losses when they were not 
being produced according to specifications. Whatever characteristics 
are decided upon, however, they must be decided upon a priori—in 
the planning stage of a study. 

As these last two examples indicate, the selection of an elemen¬ 
tary unit and of the appropriate characteristic is important to point 
a study towards a good solution. Remember, the aims of statistical 
studies generally involve finding out something about the character¬ 
istics of elementary units. Why? In order to make a rational decision 
despite some uncertainty. Unless we know the elementary units and 
the characteristics in which we are interested, a study may very well 
be a waste of time and money. 

2.3 The Frame 

In the last section we considered two important points that in¬ 
variably must be dealt with when one defines a population. How is 
an elementary unit to be defined? What characteristic is to He ob¬ 
served? Answers to these questions are necessary before a study can 
proceed. In addition, those in charge of planning the study must then 
consider whether or not their definition of the population is opera¬ 
tionally feasible; whether or not there exists any way of access to the 
population. Having considered where they want to go, they may 
need, so to speak, a “road map.” In statistics such a road map is 
called a frame. 

Access to the population is no problem in many applications of 
statistics. Thus, when you must decide whether or not to accept a lot 
of product, the lot is usually before you, more or less accessible and 
available for study. In other cases, however, access to the population 
presents much more difficult problems. For example, it is easy to de¬ 
fine a population as the brands of all radios in use. But where are all 
these radios? Can we identify their locations in terms of a frame? 
That is now the important question. 

We may formally define a frame as a list or map or some other 
concrete way of identifying all of the elementary units to be covered 
by a statistical study. Such a “list” is quite necessary in order to carry 
out many studies. Unless we have identified where and from whom 
we can gather the information we desire, how can we possibly ob¬ 
tain it? 

Example 2.8 A controller must analyze his firm’s accounting op¬ 
erations over the past several years to derive certain financial infor- 
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mation not ordinarily tabulated; e.g., the number of checks drawn 
for amounts under $2.00. The analysis is to be the basis for a deci¬ 
sion on whether or not the company should revise parts of its ac¬ 
counting system. 

Part of the study can be carried out through an analysis of can¬ 
celed checks. Thus, an elementary unit is a canceled check. One char¬ 
acteristic of interest is whether or not the amount of the check is less 
than $2.00. It is noted that the canceled checks have been filed by 
number; and that there are up to 1,250 checks per file drawer. A 
frame for the proposed study is easily constructed to cover all checks 
from July 1, 1956, to June 30, 1958 (the dates set for the study). It 
may be written down quite simply, as follows: 


File Drawer 
Number 

Includes Checks 
Numbered 

Number of 
Elementary Units 

112 


4086-5000 

! 915 

113. 


5001-6250 

1,250 

114 


6251-7500 

1,250 

115 


7501-7624 

124 




3,539 


This frame, you should note, identifies each and every elementary 
unit—all 3,539 checks—from July 1, 1956, through June 30, 1958. It 
therefore shows the location of all the elementary units of interest, 
thereby operationally defining the statistical population and provid¬ 
ing a basis for implementing the proposed study. 

Example 2.9 An accountant must value, at retail, the end-of-year 
inventory in a retail shoe store for tax purposes. He defines a pair of 
shoes as an elementary unit; the retail price of a pair of shoes as the 
characteristic of interest. Then a frame may be developed in terms of 
a series of diagrams, each diagram identifying shoe boxes or pairs of 
shoes in a certain section of the store. 

Example 2.10 The Turkish government conducts a quinquennial 
census of population. In many Turkish villages, however, there are 
no streets and houses have no numbers. How can frames be con¬ 
structed for the study? One solution, used prior to 1955, was that 
every mayor had the responsibility of numbering every house and of 
preparing a list of the numbers identifying their location. These lists, 
then, represented frames of dwelling units in the villages. 

It is important to remember that a frame is essential for a census 
(complete count) as well as for a sample study. The only difference 
in the two situations is that for a census we make observations on all 
the units included in the frame, while for a sample we observe only 
some well-selected portion of the units. Thus, to conduct a complete 
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or a sample study of faculty members in a university, there must be 
a frame to specify and to identify the teachers to be included. Other¬ 
wise a survey would be only a hit-or-miss affair. This same idea holds 
true for every study. 

Many times a frame is readily available, or at least readily con¬ 
structed, for a statistical study. However, occasionally it is difficult 
to define a satisfactory frame—particularly for statistical studies 
carried out over a wide geographical area. This is one basic difficulty 
in marketing and advertising research covering a regional or a na¬ 
tional market. 

Some common sources of frames for “widespread” surveys are 
telephone directories, customer lists, lists of voters, automobile regis¬ 
trations, tax assessor’s lists, and city directories. However, with the 
possible exception of the last two, such frames are adequate only for 
limited purposes. Usually, they are not satisfactory for statistical 
studies of the general public. Therefore, extensive use often is made 
of geographic areas to identify groups of elementary units rather 
than individual elementary units. The frame in such a case may be a 
series of maps of geographic areas. Such frames include detailed 
maps of counties, standard metropolitan areas, census tracts, city 
blocks, and census enumeration districts. Each of these areas may 
serve as a way of identifying, or at least locating, dwelling units, 
places of business, or consumers. 

Frames are subject to various imperfections, and, in practice, it is 
is very often difficult to avoid all of these potential sources of trouble. 
It is essential, therefore, to determine at the outset of a statistical 
study whether a given frame is satisfactory. 

For one thing, a frame is incomplete if it does not include every 
elementary unit that should be included within the scope of a study. 
We may or may not be aware of the fact that a frame is incomplete. 
This imperfection ordinarily will be more serious if we are not aware 
of it. For one thing, incompleteness in a frame is unlikely to be dis¬ 
covered in the course of a study. For another, it can be a serious de¬ 
fect if the incompleteness is confined to units possessing a special 
characteristic, for these units would, as a consequence, be under¬ 
represented in a study. 

At least when we are aware of incompleteness we can decide 
whether or not it will invalidate the study, and act accordingly. You 
see, at times it is not possible to construct a frame to include all 
elementary units, and a frame with some units omitted must be 
considered. We may designate the population represented by an in- 



28 


STATISTICS FOR BUSINESS DECISIONS 


[Oh. 2 


complete frame as the “working” population as opposed to the pop¬ 
ulation we are interested in—that is, the “target” population. It is 
important to recognize that any survey results, if and when obtained, 
will refer only to the working population. The key question is then: 
“Is this information adequate?” 

Example 2.11 A department store considers expanding its T.V. 
service department and offering, among other things, a new one-year 
service contract. How many new customers and what type of sets 
(new or dilapidated) will the service contract offer attract? Answers 
to these questions will provide a basis for deciding the extent of the 
department’s expansion and the rate to be charged for the service 
contract. 

Management desires to test the offer on a hundred or so customers 
before presenting it to the public at large. From what group should 
the sample be selected? What is the population? Is it represented by 
all T.V. set owners in the New York metropolitan area? If so, a com¬ 
plete frame would be almost impossible to obtain. 

However, suppose that the store does have a list of persons who 
have purchased their sets there and lists of other persons who have 
employed the store’s service department as well. Does this provide a 
satisfactory frame even though it restricts the scope of the study? 
This is a question which management and the statistician must de¬ 
liberate very carefully in the planning stage of the proposed study. 

As contrasted to incompleteness, a frame also may be impel feet 
because it contains duplicates of certain elementary units.. A study 
based on that frame may be biased. Of course, when duplication is 
present, it often can be corrected by an examination of the frame, al¬ 
though such an examination may be a tedious and costly process. 

The success or failure of many statistical studies depends on the 
type of frame that is available. For one thing, the definition of the 
population is tied closely to the frame selected. Therefore, it is un¬ 
wise to plan a study in any detail until one has knowledge of the 
availability and of the accuracy of a frame. Needless to say, if a rea¬ 
sonable and adequate working population, defined in terms of a 
frame, cannot be found, then a proposed study must be discontinued. 

2.4 Enumerative and Analytical Studies 

A statistical population, as we have defined it, consists of a set of 
observations pertinent to a decision problem. These observations 
may be finite or infinite in number, depending upon whether a statis¬ 
tical study is enumerative or analytical. 

Most statistical studies may be thought of as having one of two 
general purposes. In the first place, a study may be designed to pro- 
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vide a basis for action on a situation as it exists. This type of study is 
called an enumerative study. y 

On the other hand, a study may be designed to provide a basis tor 

>" cUr, °T ,k * *"*•» ™ ™, 

type of study is called an analytical study. 

Thus, the key to determining whether a study is enumerative or 
analytical is an answer to the question: “Are we going to take action 

sHuationr tl0n ^ * 6XiStS ° r ° n the ^eh give rt TZ 

Exompte 2.12 Ordinarily wool is imported in highly compressed 
bales. An importer, receiving a shipment of these balls, Lst 7as S ft 
and sell the wool as first grade, or second grade, etc. Thus, for each 
bale he faces a decision problem. ’ aC1 

The gra.de of a bale of wool must be determined by inspection For 
reasons of economy and expeditiousness, however, the importer 
loS’lf deci , slon + . for e ach bal e of wool on the basis of samples of 

lull f Z°° l ? th ®* han 0n the basis of a complete inspection 
. M , Pertinent at this point in our studies is the thought that the 
importer uses a sample as a basis for acting on a situation (a bal of 
ool) as it exists. No action is being taken on the process or causal 

factors m the decision problem as presented. Thus, the problem pre¬ 
sents an enumerative study. piuuiem pre- 

An enumerative study is designed to provide a basis for action on 

tions^lJT nth l Xt de u S With a fiX6d and finite set of observa¬ 
tions—m other words, with a finite population. 

. An analytical study, however, deals with a process, and any exist¬ 
ing information must be regarded as a sample from that process The 
population for such a study would include all observations that could 
be made on the process if it continued to operate indefinitely under 

finh? Th C0ndltl0nS ', We T ay C ° nSider this Set of obse rvations as in¬ 
finite. Thus, an analytical study deals with an infinite population. 

Example 2.13 An industrial engineer must estimate the average 
S a - P artlcular Pohshing operation takes. This information will be 
used, m turn,, for determining piecework wage rates. 

A m y / t£ 5 tlS ^, Cal study of the time taken t0 Perform this operation 
would deal with a process Hence, the study would be analytical. 

r T. h -?.l etlCa i, y th ® P° hshm g operation could be repeated without 
unit. Thus, the number of performance-times which could be ob¬ 
served by the industrial engineer also is infinite. This infinite set of 
performance-times is the population—an infinite population. 

, tefi U -? 1Sht note . that when a stud y is analytical and a population 
° ™ lte ’ a complete count is impossible. Such a population can 
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never be studied completely. But this should not disturb you. It 
simply means that in analytical studies we must settl « for ® 

data However, even in enumerative studies we may prefer to settle 
for sample data, not because complete data are ™ ob J a “ abl ® ^ J? d _ 
cause sample data are usually more economical all things consul 
ered. Sampling is absolutely necessary only m analytical studies, bu 
verv often it is wise in enumerative studies as well. 

Whether a study is enumerative or analytical and, hence w e er 
a population should be regarded as finite or infinite depends on the 
action to be taken in the decision problem. Is action to be taken on 
the existing situation? Or on the factors giving rise to the situation 
This fact may be further emphasized by the following illustration 
which shows that the same statistical information may be used for 
an enumerative or for an analytical study, depending upon one s 
decision problem. 

Example 2.14 The machine department of a manufacturmg com-, 

parteT* 3 the%tudy ^nuLeralive 8 oflnahSn ThtiTep^ds on’ the 
purpose of the study—on the problem for which the sample is to be 

US< The~eo\°lamS^om a lot may be to decide whether to 
scrip the lot, or to inspect the lot fully and weed out all def ®® tl '?®> 
or to send the lot into production, or to do something else with that 
particular lot. In such a case the statistical study would be enumera¬ 
tive Action is to be taken on a situation as it exists. The population 
is finite, being represented by the 1,000 parts m the lot being dealt 

^Alternately the purpose of sampling from a lot may be to decide 
whether or not to adjust the machine department s production proc¬ 
ess In that case the study would be analytical. In fact, you may note 
that even all of the parts in a given lot present only a sample from 

the process. Why? 

In Chapter 1 we distinguished between testing and estimation 
problems as one way of describing statistical studies. In this section 
we have classified statistical problems m another m ry-as en™- 
tive or analytical. These two classifications are independent A study 
maybe enumerative and involve testing or estimation or it may be 
analytical and involve testing or estimation. Don t confuse the two 

Cli Qne C last 0 comment is in order. Whether it is enumerative or ana- 
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lytical, the purpose of a statistical study is to provide a rational basis 
for action m some decision problem. In both types of studies, statis¬ 
tics is a means to rational decisions in the face of uncertainty. 

2.5 Multiobservational Problems 

Thus far we have concentrated on discussing and illustrating types 
of statistical studies in which only one characteristic is to be ob¬ 
served. Such studies are important. They arise frequently in practice 
Then, too, they present relatively simple situations. For these rea¬ 
sons, more often than not we will deal with single-observational 
problems. Other practical problems, however, are multiobserva¬ 
tional. That is, two or more characteristics of a single elementary 
unit are observed. We shall also have occasion to refer to some of 
these problems. 

Multiobservational problems arise in two types of situations. 
First of all, some statistical studies are not designed to afford a de¬ 
cision on only one problem. Rather, in the interests of economy as 
well as other considerations, they are designed to provide bases for 
decisions on a whole set of problems. This is typically the case in 
marketing and advertising research. In such studies, several char¬ 
acteristics may be of interest since there are several problems Sev¬ 
eral characteristics necessitate several different observations on any 

one elementary unit. Hence, there are several populations of interest 
in the study. 

Example 2,15 A real estate corporation is planning to construct 
several new apartment buildings. Several decision problems arise in 
this endeavor. Should they install electric or gas ranges? Should the 
kitchen floor covering be linoleum or resilient floor tile? Should the 
kitchen cabinets be wood or metal? 

The company officials plan to do a study to provide a basis for 
these and other decisions that must be made. They will ask tenants 
m present company-owned buildings to state their preferences on 
ranges, kitchen floor coverings, kitchen cabinets, etc. 

Thus, each tenant will be asked several questions. There are for 
each elementary unit—a present tenant—several observations to be 
made. These will provide bases for several decisions. The studv pre¬ 
sents a multiobservation problem. 

Multiobservational problems also arise because occasionally, even 
though you are dealing with only one problem, your study will pro- 
vide more information if you observe more than one characteristic. 
Several bits of information are, in this case, all pertinent to the one 
decision problem. 
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Example 2.16 A personnel administrator decides whether or not 
to pass executive trainees on the basis of several job ratings and 
tests. Thus, not one but several different characteristics of a trainee 
are observed. The problem is multiobservational. 

2.6 Variates and Attributes 

As has been noted, in defining a population we must specify the 
elementary units and indicate the characteristic (or characteristics) 
to be observed. 

Our observations of these characteristics can be divided into two 
main classes: quantitative and qualitative. A quantitative observa¬ 
tion is one which is expressed numerically. As the name implies, it 
involves some quantity. The unit of measurement may be a dollar, 
or an inch, or a number of employees. But quantitative observa¬ 
tions are, by definition, always numerical. Also note that the values 
which the quantitative observations may assume are called variates. 

Example 2.17 Consider the following elementary units and their 
characteristics, all of which may be observed quantitatively. Note 
also the unit of measurement and the specific numerical example of 
a variate. 


Elementary Unit 

Characteristic 

Unit of 
Measurement 

V ariate 


Annual income 

Dollars 

$5,340.40 


Time to failure 

Hours 

953 hrs. 

An pmnlovee. 

Monthly absences 

Number 

3 

A nrpfpTrpd stook ... ... 

Dividend rate 

Per cent 

5.5% 

XI Ui Civil vvl ivuiv ....... 

A hall bearing . 

Diameter 

Inches 

.06254" 

A. firm. 

Sales-inventory ratio 

Per cent 

752% 

A machine. 

Age 

Months 

75 


Quantitative observations may be further grouped as discrete or 
continuous. A discrete quantitative observation is one that can take 
only a limited number of values on any measuring scale. Annual in¬ 
come can at best be measured to the nearest cent. It cannot assume 
values ending in 1 / 2 $ %$• The number of production workers em¬ 

ployed by a firm can be an integral value, but it cannot be fractional. 
On the other hand, a continuous quantitative observation theoreti¬ 
cally can assume any value on a scale. For example, measurements of 
length, weight, and time can be thought of as continuous. Thus, in 
theory, the diameter of a ball bearing might be measured to the 
nearest .001 inch, to the nearest .0001 inch, to the nearest .00001 
inch, or as precisely as one wished. 
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In practice, of course, we may measure only as closely as our 
most precise measuring instruments will allow. As this remark im¬ 
plies, therefore, every continuous variate can be only approximated 
m the real world. It can be measured only to a finite number of places 
because of the limitations of our measuring instruments. However 
as you will see, many times statistical theory makes the simplifying 
assumption of the continuity of quantitative observations This or¬ 
dinarily is an approximation which costs little in terms of accuracy 
A qualitative observation refers to some nonnumerical property 
or some attribute of an elementary unit. As the name implies, it is a 
qua lty The terms used to describe the unit’s characteristics are in 
fact, referred to as attributes. 

Example 2.18 Here are some illustrations of qualitative charac¬ 
teristics, a few of which we have already met ill other examples: 


Elementary Unit 

Characteristic 

Attributes 

An apartment. . . . 


Occupied or vacant 
Defective or not defective 
Coal, gas, or oil 

Head or tail 

Male or female 

Absent or present 

Very satisfied, satisfied, 
neutral, dissatisfied, or 
very dissatisfied 
Increasing, stable, or 
decreasing 

A fuse. 


A dwelling unit. 

v^UaJJ ly 

Flip of a coin.. . 

i t yp“ oi neaung 

An employee. . . 

Sex 

Attendance 

An employee.... 

An employee.... 

A firm. 

opinion 

Sales level 




In some problems, qualitative data may be regarded as numerical 
data, at least indirectly. For some populations we count the number 
of elementary units having a certain attribute. By enumerating the 
number of items we get a numerical result. Moreover, in such a 
coun we score 1 for an item that has the attribute and 0 for an item 
that does not have it, actually cumulating the “ones” as we proceed 

obsTrvV 0 86t ° Ur t ° ta i 1 ' Indlrectl y> we are assigning the qualitative 
observations numerical scores, these being 0 or 1. 

Now, a key point which flows from our discussion in this section is 
that a population may be a population of variates or a population of 

attributes. Before elaborating, suppose that we look at two illustra- 

tions. 

Example 2.19 A small town has 750 homes. A builder contem 
plating a new housing development, wishes to determine the types of 

2 r. ivr' r “ ly tai,t *» Mp ii - »*■* 
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In such a problem we have the following: 

Elementary unit.Home 

Characteristic.Type of heating 

Type of observation.Qualitative 

Attributes.Oil, gas, coal, other, or none 

The set of 750 attributes (one per home) is the population under con¬ 
sideration. If sampling were to be used, it would involve observing a 
sample of these attributes. 

Example 2.20 A trade association has 310 members—tool and die 
manufacturers. It is interested in canvassing its members to deter¬ 
mine the hourly wage rate for various job classifications, including 
male drill press operators, in member companies. This information is 
then sent to members to be used in wage negotiations. 

In such a problem we have, among other things, the following: 


Elementary unit.A member firm 

Characteristic.Hourly wage rate of male drill- 

press operators 

Type of observation.Quantitative 

Variates.$1.74, $1.65, $1.90, etc. 


The complete set of variates is the population for this problem. It 
is a population of quantitative observations, in contrast to the one in 
Example 2.19 which was a population of attributes; that is, of quali¬ 
tative observations. 

As we have noted earlier, a statistical population must be defined 
clearly and unambiguously in the planning stage of a statistical 
study. What is to be an elementary unit and which characteristic is of 
interest are very important considerations. In addition, so is the 
matter of whether the statistical population of study is one of vari¬ 
ates or one of attributes, and the matter of defining the attributes or 
the variates in clear-cut, objective, and meaningful terms. To rein¬ 
force these ideas in a specific way suppose that we consider two 
further illustrations. 

Example 2.21 Several different types of decision problems re¬ 
quire, for their solution, statistical studies of the ages of persons re¬ 
siding in a certain area, or of the ages of teachers throughout the 
country, or of the ages of employees in a certain company. For exam¬ 
ple, a firm’s decision on whether or not to set up a pension plan and 
even its decision on how to set it up necessitate some information on 
the age distribution of its employees. 

Thus, these statistical studies are all similar in that they involve 
the characteristic of age. This characteristic, however, must be more 
specifically defined. Is the characteristic of interest a person’s age at 
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birthda t v? bi lfl d h P y ’rh a t hi + f " thc T3 ng birthd ay, or at his nearest 
Birthday/ Is the characteristic of interest age in years age to the 

nearest month, or age to the nearest day? Answers to these questions 

beS 5",Mb.r7°" ‘ »>«> to whith the rite will 

JStStafcT Z:i" ”“* be P '" i " bd °» "« “7 «■« 

. Sl , nce age usuall y is expressed numerically, studies of age usuallv 
involve a population of variates. However,’ suppose the derision 
proMem only requires information on the number of persons of 
s hool age (defined as 5 to 16 years old). The study then involves 
a population of attributes since each elementary unit, a person mav 

sc e hooi a a a g C e” IZed 7 6lther ° f the attributes - “«*ool age” and "not 

Example 2.22 A merchandising company executive feels that 
through proper packaging the sales of one of his firm’s products may 

^in e Cr The d co A marke *“ B consultant suggests a new method of pack- 
gmg The company thus has a decision problem involving two al- 

And ° f actl0n: W continue to use the old method 

Shods^/tsam r W “ etThe feds the USe <>f both 

methods at the same time would be impractical. 

A statistical study of consumer preferences is proposed Thus a 

question arises as to what type of information, if any, would help the 

company executive decide between the alternatiV*? Sthrst 

n ° U -viV- S *?< de , ne a P°P uIatlon of attributes including only the 
possibilities “prefer new to old” and “prefer old to new ” 

hiiSelf W “Tf itnewT reSea f h H d ° ne ’ the executive should ask 
■no i * + * J knew the number of consumers who prefer our new 

package to the old and the number who prefer the old to the new 

vh^P V u® req i red decision?” A little thought should con¬ 

vince him that he could not. The population has not been defined 

triWp 7 n0r , c ° rre , ctl y- w °uid be preferable if the following at- 
tributes were defined as those to be included in the population of 

a) Would buy new but not old. 

b) Would buy old but not new. 

c) Would buy either old or new. 

d) Would not buy either. 

The complete set of these attributes for all elementary units—con¬ 
sumers (this too would have to be defined more carefuHy -wouW 
sons, f ar “ 0re reaSOnaWe P 0 P^ a tion of study for almost obvious rea- 

We should note briefly that it is customary to introduce some spe¬ 
cial terminology for observations in multiobservational problems 
called "!' observation is concerned with two characteristics, it is 

wfi! b ™ ar ; at l ohaer ™ ti ™ and the population of the observa¬ 
tions is called a bivariate population. In contrast, an observation 
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which refers only to one characteristic is termed univariate. For 
more than two different characteristics, the terms “multivariate ob¬ 
servation” and “multivariate population” are used. However, the 
terms “univariate,” “bivariate,” and “multivariate” should not mis¬ 
lead you. One or all of the characteristics in such a case may be at- 
tributes as well as variates. 

2.7 Frequency Distributions 

Thus far we have been discussing and thinking about statistical 
populations in terms of a complete group of observations—either 
variates or attributes. In a given population of study the variates or 
the attributes ordinarily will not be the same but will vary from one 
elementary unit to another. In addition, it is important to recognize 
that statistical studies are not concerned with individual observa¬ 
tions on elementary units. Statistical studies are impersonal to the 
extent that they are not interested in whether Jones, Smith, and/or 
Brown have station wagons—they are interested only in how many 
of those elementary units have them. In short, statistical studies are 
designed to provide information about populations—not about spe¬ 
cific individuals. 

Generally speaking, the most detailed summary of a complete 
set of observations with which we ever would be concerned m a sta¬ 
tistical study is a frequency distribution. In a broad sense the term 
“frequency distribution” may be applied to any forms of the data 
that arise when the occurrences of a particular attribute or of a suc¬ 
cession of variate values, in a population, are counted or enumerated. 
Thus, a frequency distribution shows all different variates or attri¬ 
butes’ in a population and their frequency of occurrence. 

For example, a census of the kinds of heating equipment in use 
would involve counting the number of homes that have each kind. 
The result would be a population of attributes described in terms of 
their distribution according to types of heating installations. 

Such distributions of attributes are very common. Leaf through 
the pages of a compilation such as the Statistical Abstract of the 
United States and you will find examples by scores and hundreds. 
Some of these counts are made according to geographic areas, or for 
periods of time; others, for example, are by classes of merchandise, 
by occupations of persons gainfully employed, by nationality, or by 
sex—as well as a great many more. 

Example 2.23 Many investors in stocks and bonds base their fi¬ 
nancial decisions on shifts and trends in the over-all market as well 
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as on information about individual securities. In this connection, one 
population of interest is the directions of change, for a given period of 
time, of stocks listed on the New York Stock Exchange. In this case 
an elementary unit is a stock traded on that exchange, and the 
population is one of attributes. The population of changes for July 1 
1958, may be shown in frequency distribution form as follows: 

Attribute Frequency 

Increase.~ 460 ~ 

No change. 269 

Decrease. 447 

Total.1,176 

It should be noted that this distribution presents a population for 
the data covered if, and only if, the data are to be used in an enumer- 
ative decision problem. If the study were analytical, dealing with the 
market process, the data would be sample data; they would repre¬ 
sent a sample of but one day. Note also that the stocks included 
are only those traded on the New York Stock Exchange. There E 
always the question of whether or not this is a population adequate 
for a problem dealing with stocks in general. 

^ A frequency distribution of a population of attributes may be 
pictured graphically in terms of a bar chart. The frequency distribu- 


Figure 2. ? 

DIRECTIONS OF CHANGE FOR PRICES OF 1,176 STOCKS 
NEW YORK STOCK EXCHANGE: JULY 1, 1958 

NUMBER OF CHANGES 



tion shown in the preceding illustration is presented in that form in 
Figure 2.1. Note the structural resemblance between the numerical 
presentation and the chart of the data. 

A somewhat different situation—a frequency distribution of a 
population of variates is suggested if we consider a study of hourly 
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wage rates in a given industry by a trade association. Suppose that 
the information is to be used by member firms in setting wage sched¬ 
ules. From the reports of the member firms the association might 
compile long lists of wage rates. However, a moment’s reflection will 
indicate that in that form the data scarcely will yield the information 
that the firms need. The interest is not in individual rates paid by 
individual firms but rather in the distribution of different rates in 
the industry as a whole. How much difference is there between the 
lowest rate paid and the highest? Do many of the rates tend to 
cluster around some norm? Is the tendency very marked, or only 
moderate? Is the norm at a high or a low level in the scale of rates? 

If the wage rate data are not arranged in some special order, or 
given a definite structure, the very mass of the numerical facts will 
tend to obscure whatever significant information they may contain. 
Useful answers to such questions as those above will be difficult to 
obtain. 

A simple but effective procedure may be employed to reveal the 
internal structure of the data. It is carried out as follows: (1) set up 
quantitative groupings of wage rates that will serve to break up the 
whole range of rates into a limited number of classes; and (2) count 
the number of rates that fall within the limits of each class. Typical 
results are illustrated in the next example which covers a situation 
analagous to the one we have been discussing. 

Example 2.24 The Monthly Labor Review , May, 1958, presents 
a study of average straight-time hourly earnings for production 
workers in woolen mills as of September, 1957. Data are given for 
the United States as a whole and for various geographic regions. To 
serve the purposes of this example only the distribution of workers in 
the New England region has been used, together with that for the 
United States. The figures shown below have been derived by com¬ 
putation from those that appear in the original source: 


Average Hourly Earnings 
(in Dollars) 

Number of Workers 

United States 

New England 

1.00 and under 1.20. 

6,537 

2,009 

1.20 “ “ 1.40. 

20,258 

9,128 

1.40 “ “ 1.60. 

15,135 

9,453 

1.60 “ “ 1.80. 

8,598 

4,815 

1.80 “ “ 2.00. 

4,711 

2,511 

2.00 “ “ 2.20. 

2,532 

1,329 

2.20 “ “ 2.40. 

648 

266 

2.40 or more. 

294 

59 

Total. 

58,713 

29,570 
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Note that as a result of dividing up all of the wage rates among 
the various classes we have a frequency distribution. In place of a 
large number of individual rates we have a compact description of 
a population of variates in terms of the frequency of their occur¬ 
rence and of the essential characteristics of their distribution. In ad¬ 
dition, although it ignores which elementary units have which char¬ 
acteristics, it summarizes the information of statistical interest- 
how many elementary units have certain characteristics. At a later 
stage of our study we shall give somewhat more attention to the tech¬ 
niques applicable to the construction of frequency distributions. 

A frequency distribution of a population of variates may be shown 
graphically in the form of a histogram (or column diagram). For ex¬ 
ample, the frequency distribution for the United States shown in the 

Figure 2.2 

DISTRIBUTION OF PRODUCTION WORKERS IN WOOLEN MILLS 
UNITED STATES: SEPTEMBER, 1957 
By Average Straight-Time Hourly Earnings 



preceding illustration is presented as a histogram in Figure 2.2. It 
should be noted that a direct relationship exists between the areas of 
the diagram and the frequency, or number, of occurrences. The area 
of the whole figure is proportionate to the total number of observa¬ 
tions, and the area of each column individually is in proportion to the 
frequency of the class it represents. Thus the histogram very effec¬ 
tively pictures the form and character of a population’s distribution. 

At times we may prefer to convert actual, or absolute, frequencies 
to a percentage form. This results in a relative frequency distribution 
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in which the frequency of each class is expressed as a ratio to the 
total number of cases. One reason for doing this arises in a situation 
in which we want to make a comparison between two population 
frequency distributions. In such a case, it may be misleading, or at 
least difficult, to compare actual frequencies because one population 
includes far more observations than the other. Relative frequencies 
may indicate more clearly the differences and/or similarities in the 
two populations of interest. 

In Example 2.24, if we attempt to compare the distribution for 
New England with the one for the United States generally, we find 
so much numerical difference in the respective frequencies for given 
classes of hourly earnings that the result is without any significance. 
We almost inevitably must ask ourselves such questions as: “Are 
8,598 workers (at the $1.60 to $1.80 level) a greater or a lesser pro¬ 
portion of all United States wool production workers than 4,815 are 
of those in New England?” In other words, we must think in relative 
rather than in absolute terms. The next example should help to 
make clear the greater effectiveness of percentage distributions in 
situations of this sort. 

Example 2.25 Following are relative frequency distributions of 
production workers in woolen mills by average straight-time hourly 
earnings. The basic data are the actual distributions of Example 
2.24. (Items do not add to exactly 100 per cent due to rounding.) 


Average Hourly Earnings 
(in Dollars) 

Per Cent of Total Workers 

United States 

New England 

1.00 and under 1.20. 

11.1 

6.8 

1.20 “ “ 1.40. 

34.4 

30.9 

1.40 “ “ 1.60. 

25.7 

32.0 

1.60 “ “ 1.80. 

14.6 

16.3 

1.80 “ “ 2.00. 

8.0 

8.5 

2.00 “ “ 2.20. 

4.3 

4.5 

2.20 “ “ 2.40. 

1.1 

.9 

.2 

2.40 or more. 

.5 

Total. 

100.0 

100.0 


In this section we have viewed the frequency distribution as a 
device mainly for summarizing a statistical population. It must be 
stressed again, however, that in most decision problems we cannot 
construct frequency distributions of populations—simply because 
complete information rarely is available for such a task. Neverthe¬ 
less, the frequency distribution is an important concept. Occasionally 
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a population is available and may be so summarized (as in this sec¬ 
tion’s illustrations). Also, you now should realize that in many prob¬ 
lems, even though only a sample of observations is to be obtained, 
thinking about the population is necessary in order to focus on the 
type of data required to make a decision. In addition, thinking about 
the population in terms of a frequency distribution points up in 
which form these data are needed in many problems. Then, too, 

frequency distributions are useful in other connections in statistics_- 

for example, to summarize sample data for further analysis. In our 
study of statistics we will meet several different situations in which 
we may use the concept of a frequency distribution. 

2.8 Population Models 

Frequency distributions of variate populations, as you may well 
imagine, can assume an almost endless variety of shapes and sizes. 
Nevertheless, it is possible to classify them into a few general types. 
For example, in Figures 2.3—2.8 we have sketched a series of histo¬ 
grams to illustrate some of these general types of frequency distribu¬ 
tions of populations which one meets in statistical practice. 

Accompanying each histogram is a smooth continuous curve which 
depicts the more salient features of the frequency distribution. We 
will refer to these smooth curves as population models. In using 
smooth curves such as those drawn, a statistician resembles a carica¬ 
turist who by a few bold strokes of his pen can so capture the essence 
of an individual that he is easily recognizable. Although such smooth 
distributions are never observed in practice, they present a simplified 
method of describing populations. The term “population model” is, 
in fact, suggestive. It connotes that the smooth curves are models 
rather than perfect replicas of populations. These models or curves 
can be represented by mathematical equations, and as we shall see 
later in the text, the use of such equations offers a distinct advantage 
in the development and application of statistical theory. 

We may turn now to a discussion of the illustrations of the popula¬ 
tions and their models provided by Figures 2.3 through 2.8. Fig¬ 
ure 2.3 presents the frequency distribution of the heights of soldiers 
inducted into the Army in 1943. This distribution is low at the two 
extremes but rises fairly steadily to a single peak which is near the 
center of the distribution. There are very few extremely short sol¬ 
diers, very few extremely tall ones, and a larger number of soldiers of 
moderate height. The most typical height falls between 67.5 and 
68.5 inches. The pattern of variation to the left and to the right of 
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this most typical height is almost symmetrical. In the diagram ac¬ 
companying the histogram we have drawn the smooth curve (popu¬ 
lation model) which describes these main features of the distribu¬ 
tion. 

The model, called the normal curve or the normal distribution , is 
of special interest to statisticians for many reasons, and we shall dis¬ 
cuss it in great detail in later chapters. It satisfactorily describes 
many populations for a large variety of physical characteristics and 

Figure 2.3 



HEIGHT (IN INCHES) 

for several measures of educational and mental attainment such as 
IQ tests. In addition, many quality characteristics—diameters, 
weights, lengths, etc.—of manufactured product also will be found, 
in many applications, to be normally distributed, or at least approxi¬ 
mately so. Several other statistical populations involving business 
and economic data also are approximated closely by the normal dis- 
tribution. 

Figure 2.4 involves a frequency distribution of the number of oc¬ 
cupants living in dwelling units in the United States. This distribu¬ 
tion has a single peak, as the last population did, but it is located at 

Figure 2.4 



NUMBER OF OCCUPANTS 
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the lower end of the distribution. In other words, the branches of 
the curve are not symmetrical about the peak. Such asymmetrical 
distributions are known as skewed distributions. Furthermore, since 
the longer tail of the distribution is toward the right, we say that it is 
skewed to the right or positively skewed. The accompanying popula¬ 
tion model which summarizes these pertinent facts is also commonly 
encountered when one deals with business and economic data. It 
might be used, for example, to depict populations of personal in¬ 
comes, corporate incomes, sales, commodity prices, assets of com¬ 
panies, etc. 


Figure 2.5 



COST OF SALES TO NET SALES RATIO 
(PER CENT) 



Figure 2.5 represents the distribution of the ratio of cost of sales 
to net sales for a segment of the plastics industy and the correspond- 
ing population model. Note that this type of population is also 
skewed, but since its longer tail is to the left, we say it is skewed to 
the left , or negatively skewed. Populations of variates which have an 
upper bound may, as a general rule, be approximated by this 
model. Thus, the upper bound of the ratio of cost of sales to sales is 
roughly 100 per cent because firms could not stay in business for any 
length of time if their cost of sales equaled or exceeded their sales. 

The distribution of the number of corporations filing corporate in- 


Figure 2.6 
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come tax returns classified by size of their assets is shown in Fig¬ 
ure 2.6 together with a population model representing it. Because of 
its shape such a model often is referred to as a (reversed or inverted) 
J-shaped distribution. This model also is descriptive of various types 
of “failure” data. For example, a population of times to failure of re¬ 
tail businesses would have the same general shape as the model in 
Figure 2.6. Most firms, if they fail, fail rather quickly. The longer 
a firm is in business, the less its chance of failing. Hence fewer firms 
fail after having been in business several years; still less fail after 
having existed for many years. 

Figure 2.7 



There are several other less common population models of some 
interest in statistics. For example, Figure 2.7 portrays a U-shaped 
frequency distribution of ages of viewers of a Jack Benny program 
and its appropriate population model. This type of distribution is far 
from common in practice, but obviously it does occur. 

Some distributions have only one peak and are called unimodal. 
On occasion we may encounter distributions with two (or more) 
prominent peaks—bimodal (or multimodal) populations. Thus Fig¬ 
ure 2.8 portrays the distribution of the ratio of the value of plant and 
equipment to net worth for a group of plastic molders in the United 
States, and a model which describes this series. The two peaks 
occur because the distribution combines ratios for small and large 
firms in the industry. While the ratios of the larger firms are con¬ 
centrated between 40 and 50 per cent, the ratios for smaller firms 
are concentrated between 70 and 80 per cent. 

Another example of a bimodal population would be a frequency 
distribution of the heights of males and females. Characteristics of 
the work produced on different shifts but presented in one distribu¬ 
tion might also assume this form. Thus, one reason for bimodal pop- 
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ulations is that they include two sets of somewhat different char¬ 
acteristics (e.g., male heights and female heights). 

The various population models we have been discussing are, of 
course, quite useful in statistical work. Why? First of all, they offer a 
simple method of describing a statistical population. Thus, if we say 
a population is very skewed to the right, a general picture of the pop¬ 
ulation emerges. Similarly, the use of the terms “symmetrical,” 

Figure 2.8 



(PER CENT) 


unimodal," a U-shaped," or “J-shaped” call to mind essential fea¬ 
tures of frequency distributions of populations. 

Second, a population model is very often an important factor in 
determining the pertinent fact or facts we have to know about the 
population in order to reach a decision on a course of action in a 
given problem. We will discuss a few uses of models in this connec¬ 
tion in Chapter 3. 

Third, we have already stated that a key problem in statistics is to 

keep the cost of obtaining observations within economic limits_ 

that is, as low as economically feasible. Population models are im¬ 
portant in this connection because the design of statistical studies is 
often simplified by some knowledge of a population's shape. We 
can see now that such information is often available on the basis of a 
priori reasoning. For example, even if we had never seen the results 
of a census of retail trade, we could deduce that the distribution of 
sales volume would be very skewed. It is common knowledge that 
there are many small stores with a relatively small sales volume but 
few very large retail outlets whose sales run into hundreds of millions 
of dollars. 
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In conclusion, it may be noted that although we have limited our 
discussion in this section to population models, many of these same 
smoothed curves are used to describe other types of frequency dis- 
tributions. 

2.9 You Should Now Know That 

The first step in the planning stage of a statistical study, or at 
least the first thing to do after identifying the problem, is to define 
the population. 

A population is the totality of all pertinent observations that 
might be made in a given problem. 

Defining the population is an objective and clear-cut way of stat¬ 
ing what type of information is needed to make a decision. 

Statistical studies involve observations of characteristics of ele¬ 
mentary units. 

The choice of an appropriate elementary unit and of the charac¬ 
teristic of interest are two important steps in defining a population. 

In many statistical problems a frame is necessary to identify the 
elementary units to be studied. A frame is a list or map or some other 
concrete way of identifying those units, and it provides a means of 
access to the population. 

A frame is as essential for a complete census as for a sample study. 

In practice, frames often are subject to various imperfections. 

Populations may be finite or infinite, depending on whether the 
statistical study is enumerative or analytical; this, in turn, depends 
on the purpose of the study. 

Some statistical problems are multiobservational since they deal 
with observations on more than one characteristic of an elementary 
unit. 

Observations may be quantitative or qualitative. If quantitative, 
the values the observations may assume are called variates; if 
qualitative, the properties observed are called attributes. 

A frequency distribution shows the different observations in a 
population and their frequency of occurrence. 

A population model of a variate distribution is a smoothed curve 
representing the salient features of the distribution. 

2.10 You Should Now Be Able to Solve 

1. Comment on the following statement: “The determination of 
the statistical population to be studied is not wholly a statistical 
problem.” 
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2. For each of the following proposed studies indicate (i) an ele¬ 
mentary unit, (ii) a characteristic of interest, and (in) whether ob¬ 
servations would be quantitative or qualitative. Explain each answer 
with reference to the specific problem. 

a) A college is planning a survey to determine the major fields of all 
matriculated students ; the information is to be used in deciding on 
future course offerings. 

b) A municipal housing authority is planning a study to determine the 
demand, among low-income families, for 1-, 2-, 3-, and 4-bedroom 
apartments—the study to provide a basis for planning its new con¬ 
struction. 

c) The U.S. Bureau of the Census is planning its Census of Manufac¬ 
tures which is to include data on the total number of production 
workers employed in manufacturing establishments. 

3. Discuss the suitability of the frame suggested in each of the fol¬ 
lowing situations: 

a) The telephone directory as a frame for a study by a city council¬ 
man to determine the attitudes of voters in a given city towards a 
proposed bond issue. 

b) Instructors’ class rolls as a frame for a study to determine the num¬ 
ber of absences from classes by freshmen, sophomores, juniors, and 
seniors at a university. The study is to be used as a basis for a de¬ 
cision on the university’s policy on the allowed number of absences. 

c) A list of past purchasers of EZ-shave electric razors (obtained from 
warranty registration cards returned to the company) as a frame for 
study on consumer preferences for a new model razor. 

4. A manufacturer of electric switches suddenly experiences a 
sharp increase in the number of defective wall switches returned by 
customers. He plans to check his manufacturing operations as a basis 
for deciding on whether or not to make any adjustments in the oper¬ 
ations. Is the proposed study enumerative or analytical? Explain. 

5. American Standard , Z1.3-1958 1 includes the following com¬ 
ments : 

The manufacturer’s inspection of his own product serves two purposes: 
Purpose A. To provide a basis for action with regard to the product al¬ 
ready at hand; as for instance, to decide whether the particular article 
or lot of product at hand should be allowed to go out, or some alternative 
disposition made (inspected further, sorted, repaired, reworked, scrapped, 
etc.). 

Purpose B. To provide a basis for action with regard to the production 
process, with a view to future product; as for instance, to decide whether 


1 American Standard, Control Chart Method of Controlling Quality During 
Production, Zl.3-1958 (New York: American Standards Association, Inc., 1958), p. 5. 
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the process should be left alone, or action taken to find and eliminate dis- 
turbing causes. 

Which of these situations presents a finite population of study? 
Which an infinite population of study? Explain. 

6. A magazine plans to conduct a survey to provide information 
about its readers that will aid advertisers in reaching decisions on the 
preparation of advertising copy. Why might such a study be multi- 
observational? 

7. A statistician once stated: “I have never yet seen an inspection 
problem which would not benefit from the point of view that the 
product to be inspected was a frequency distribution.” 2 What do you 
think prompted this comment? 

8. What type of population model would best represent the fol¬ 
lowing populations? Describe as symmetrical, or U-shaped, or 
skewed to the left, etc. Sketch the smooth curve you have selected, 
and, in each case, explain the reasons for your choice. 

a) The monthly rents of 3-room apartments in Chicago. 

b) The diameters of 2-inch dial knobs produced by a plastics firm. 

c) Scores on College Entrance Board examinations. 

d) Ages of unemployed persons 14 years old and over. 

e) The ages of readers of a science-fiction comic book. 

2.11 You Will Also Find That 

Many of the basic concepts regarding statistical populations, ele¬ 
mentary units, characteristics, and observations are discussed and 
illustrated in a number of statistical textbooks including the follow¬ 
ing: 

Sprowls, R. Clay. Elementary Statistics , pp. 8-10. New York: Mc¬ 
Graw-Hill Book Co., Inc., 1955. 

McCarthy, Phillip J. Introduction to Statistical Reasoning , pp. 8-20. 
New York: McGraw-Hill Book Co., Inc., 1957. 

For an extensive discussion of the use and the imperfections of 
frames in several different types of statistical studies, see: 

Yates, Frank. Sampling Methods for Censuses and Surveys, pp. 48-49, 
60-87. New York: Hafner Publishing Co., 1949. 

The terms “enumerative” and “analytical” are due to Professor 
W. Edwards Deming. For further illustrations see: 

2 G. D. Edwards, cited in Eugene L. Grant, Statistical Quality Control (2d ed.; 
New York: McGraw-Hill Book Co., Inc., 1952), p. 407. 
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CHAPTER • 3 


Decision-Parameters 


3.1 Introduction 

Chapter 1 dealt with some of the basic principles of decision mak¬ 
ing It also included the thought that some decision problems are 
statistical. It was pointed out that statistical decision problems are 
characterized by a decision maker’s uncertainty about so “ e ^ 
nent conditions surrounding his problem situation, that is, by 
uncertainty about the true state of the world. They also are char¬ 
acterized by the fact that observations or numerical information will 
help reduce this uncertainty. 

Chapter 2 focused on one step in statistically defining a statistical 
decision problem—on the question of how one goes about rigor¬ 
ously specifying the type of information needed m order to rcduce or 
eliminate that uncertainty. This step was called defining the statis- 

tlC A second step, the topic of this chapter, is called specifying the 
decision-parameter and presents a way of statistically describing 
the condition of one’s problem situation. As you will see, specifying 
the decision-parameter is intimately tied to the definition of a pop¬ 
ulation. In practice, they are not two steps but one 

First note, however, that even though a decision maker knew is 
whole population of interest, a wise decision would not flow from this 
mass of observations-for these data, in themselves wou d not tell 
him what he needs to know in comprehensible form. Usually, to ta e 
action he would like to know some summary property of the popu¬ 
lation’such as the population average or the population proportion 
(two things which we will study in detail below). For example, even 
if the population of your statistics quiz grades were known your 
final mark would be based on some average of these grades, bimilarly, 

50 
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even ^ ^ personnel adiriinistrator knew how each one of his employ¬ 
ees felt about a social activities program, he ordinarily would base 
his decision on the proportion of employees who favored (or who did 
not favor) the program. 

The average and the proportion in these situations represent 
ways of describing conditions pertinent to the decision problems. 
Hence, they represent ways of statistically describing states of the 
world. Such measures may be called decision-parameters. 

Thus, a decision-parameter is a property of a population which is 
needed as a basis for making a decision. It summarizes, in a single 
number, information regarding the population. It presents, for many 
decision problems, a method of describing states of the world. In this 
chapter we will study, in detail, several important decision-parame¬ 
ters, including those already noted. To further reinforce this general 
idea, however, let s first look at two illustrations of decision-parame¬ 
ters and their role in decision problems. 

Example 3.1 Every college receives many applications from can¬ 
didates for admission. A college thus faces a decision problem on 
whom it should admit. One population of interest to admissions 
directors involves scores on the College Entrance Board examina¬ 
tions. However, the whole population of scores is not necessary in or¬ 
der to decide who should be admitted; only certain properties of the 
population are needed. Often the property of the population needed is 
the 90th centile (or 90th percentile)—that score under which 90 per 
cent of the scores fall and over which 10 per cent of the scores fall. 
Thus, students with scores over the value of this decision-parameter 
are admitted; others rejected. 

Other schools, of course, may use the 75th centile or some other 
centile as their decision-parameter. Whatever their choice, however, 
note that it describes states of the world. If the centile is known, it 
provides a college with knowledge of the true state of the world and, 
hence, a basis for deciding who should be admitted. 

Example 3.2 A baby food products company has a mailing list 
of customers. The list is used to identify customers to whom cou¬ 
pons redeemable for samples or sales promotional literature are for¬ 
warded. Names, derived from many sources, are continuously added 
to the list. Because of this variety of sources, the number of dupli¬ 
cate, triplicate, and quadruplicate names builds up in the list, and 
from time to time the company sorts out these repeats. This, however, 
is a somewhat costly and tedious operation. 

At any time the company faces a decision problem: to sort or not 
to sort the list. The population consists of the names in the list. A 
frequency distribution of the population, then, might show the num¬ 
ber of names occurring once in the list, occurring twice, occurring 
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three times, etc. However, this would be more detailed information 
about the population than is needed to make the decision on 
whether or not to sort out the list. What is needed is the number of 
different names in the list. 

Thus, suppose they have a list of 250,000 (this is assumed to be 
known)! They would not need to know that 180,000 names appeared 
but once, 20,000 appeared twice (representing 40,000 entries on the 
list), and 10,000 appeared three times (representing 30,000 entries on 
the list). They would be interested only in the fact that 210,000 of 
the 250,000 names were unique, while 40,000 were elsewhere in their 

list. . . 

Thus, the total number of different names in the list is the correct 

way of describing the problem situation. It represents states of the 
world and is, hence, an example of a decision-parameter. 

Thus, as these two illustrations indicate, the nature of the problem 
determines the appropriate decision-parameter. Sometimes a total 
number is needed, sometimes a proportion, sometimes an average, 
and sometimes some other property of a population. Sometimes, of 
course, management might wish it knew two or more properties of 
the population before making a decision on a problem. For example, 
a manufacturer of light bulbs may wish to know the average quality 
and the variability of his product, in which case the problem involves 
two decision-parameters. 

Any statistical population usually has many properties—an aver¬ 
age, a minimum value, a maximum value, various proportions, etc. 
Every such property is called a parameter. Not every parameter is, 
however, a decision-parameter. One never need know every property 
(parameter) of his population in order to make a decision just cer¬ 
tain important properties, namely, those decision-parameters of 
interest. Thus, the key question in determining the appropriate de¬ 
cision-parameter (s) is always: “What do we have to know about 
the population in order to make a decision? 

In some situations the answer to this question will point up the 
fact that we need a frequency distribution of the population. It may 
be noted, however, that a frequency distribution is a set of decision- 
parameters since each frequency is a property of a population. This 
ordinarily is the most detail about a population one would need in 
order to reach a decision. 

Example S.S A haberdasher must decide on the proportion of his 
pajama inventory that should be in sizes A, B, C, D, and E. The 
pertinent fact for his decision problem is the relative frequency dis¬ 
tribution of his male customer’s pajama sizes. Thus, after having de- 
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cided on how much of an inventory to carry (this would depend on 
other information) he might, given the distribution of sizes, buy pa¬ 
jamas in proportion to his population’s relative frequencies. For ex¬ 
ample, if he were able to determine that 10 per cent of his customers 
wore size A, he might buy 10 per cent of his stock in that size. 

Knowledge of an average size or the most commonly requested size 
rarely would be helpful to a small haberdasher. He must know the 
entire distribution of sizes. 

Example 3,4 A company considers setting up a group life insur¬ 
ance program for employees. It contacts an insurance agent and re¬ 
quests information on the initial cost of such a program. This pre¬ 
sents the agent with an estimation problem. In order to determine the 
premium, he needs to know the frequency distribution of the ages of 
employees (or the distribution of birthdates). He could then apply 
rate tables to determine the premium. 

Note that knowledge of the average age of employees would not be 
sufficient information for the agent to decide on the premium. For ex¬ 
ample, suppose the agent knew the average age of 2 employees was 
50. The premium would be different if their ages were 25 and 75 
rather than 49 and 51. Life insurance rates are not proportional to 
ages. 

Thus, in some problems the population frequency distribution is 
the fact about a population needed by a decision maker. This, then, 
is the way we describe states of the world. Each state of the world is 
some frequency distribution; the true state of the world is the true, 
but often unknown, frequency distribution. 

Of course, the idea of a decision-parameter does not eliminate one 
basic problem—the fact that often complete information about a 
population is unavailable and, therefore, so are those pertinent facts 
called decision-parameters. Once again, however, it is necessary to 
realize that we must first determine what information we need— 
and do this before worrying about how we are to obtain it. Statistical 
methods and theory provide a basis for getting some information 
via sampling, but one must first know for what to sample and why he 
is sampling. Thus, specifying the decision-parameter is an important 
step to take in the planning of a statistical study; it should be taken 
coincidentally with the definition of the population. 

3.2 The Proportion 

. ma y surprise you to find that one of the most useful of all de¬ 
cision-parameters is also one of the most simple: the proportion . 

To refresh your recollection on the nature of proportions consider 
a housing study of occupied and unoccupied apartments. If the pop- 
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ulation were known, it might be found that of 1,250 apartments in a 
specified area, 14 were unoccupied. Hence, the proportion of unoc¬ 
cupied apartments would be 


14 

1,250 


.0112 = 1 . 12 % . 


Alternately, if 14 apartments are unoccupied, 1,236 must be oc¬ 
cupied, and the proportion of occupied apartments would be 


= .9888 = 98.88% . 

It will be convenient, in our study of the field of statistics, to adopt 
various symbols to represent different quantities or measures. You 
may think of these symbols as names or synonyms. Among the dif¬ 
ferent quantities we will want to name symbolically are. decision- 
parameters, and to keep our name calling reasonably consistent, we 
will adopt Greek letters for almost all decision-parameters. 

As a start, suppose we agree that the population proportion is to 
be known as *■ (the Greek letter pi). In any specific problem, tt is 
whatever proportion we define it to be. Thus, in the illustration 
noted two paragraphs ago, if we define ^ as the proportion of unoc¬ 
cupied apartments, v = .0112; if we define v as the proportion of oc¬ 
cupied apartments, ^ = .9888. 

Note that in this section we are studying the proportion as a prop¬ 
erty of a population. Moreover, we are studying the proportion as a 
decision-parameter—that is, as a property of the population which 
would serve, if known, as a basis for decision. Of course, in a given 
problem, tt is the decision-parameter of interest only if that property 
of the population is needed. 

Despite its simplicity, the proportion arises as the decision-param¬ 
eter in many problems; that is, it represents, in many situations, the 
key bit of information a decision maker would need in order to make 
a choice among alternative courses of action. 

Example 3.5 A company is considering the simplification of sev¬ 
eral types of jobs. Pertinent information on these jobs in defining 
new work loads is the time spent on supervisory, as opposed to non- 
supervisory, work. Exactly which time of the day (e.g., 9.01-9.22 
a.m., 9:30-9:52 a.m., etc.) is spent on supervisory work ordinarily is 
unimportant in this type of decision problem. Usually, the pertinent 
fact about the population of study is the proportion of time spent m 
supervisory work. Thus tt is, for such problems, a decision-parameter. 

Example 3.6 A firm has a policy of attempting to fill orders 
within a week’s time. In the past it has determined that only about 5 
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per cent of the orders do not meet the specified time limit, and this is 
a level management feels is satisfactory. Recently, however, a few 
complaints about delays in shipments have been received. In view of 
the complaints the firm wants to determine if its order-filling opera¬ 
tion has slowed down or has otherwise deteriorated. 

The decision problem is to retain the present system or to revise it. 
ihe appropriate decision-parameter is ■*: the proportion of delayed 
°fl er ® Pf 6 ® 61 ?* system allows - It is, as the problem has been form¬ 
ulated, the key bit of information the firm needs in order to choose 
between its alternatives. For example, management may answer the 
question of how the information is to be used by stating that a value 

svstem a Tb S “T i PCT T ^ requires damping of the present 
y m. This, of course, depends on present managerial policy. In anv 

case, is the important property of the population for the problem/ 


A m de ^ S1 ° n ' Pararaeter presents a way o f describing states of the 
world. Thus, m problems in which the proportion is the key decision- 
parameter, it represents a description of states of the world and the 
true value of » represents the true, but ordinarily unknown, state of 
the world. A state of the world, as you should recall, is a complete 
description of one’s problem situation. It is important because the 
consequences of any decision depend not only upon the decision one 
makes but also on which state of the world is true. In problems in 
which the proportion is the key decision-parameter, then, the con¬ 
sequences of a decision depend on the decision made and the value of 
as well. 


in SrSnfno A “anufacturer periodically purchases piston rings 
in lots of 2,000. As each lot is delivered to his plant, he must decide 
between two courses of action: (1) inspect the lot of product fully to 
weed out the defective parts, or (2) immediately send the full lot of 
parts on to his production line without any inspection. 

9ono SPe °l 10n C j StS ^ ^ b ° Ut 5?S per item > or $ 100 for a full lot of 

nftes P „Th T d + We Sh t aSS T 6 that 100 per cent ins Pection elimi- 
nates all defectives. Even if alternative (2), no inspection is 

end P nf tl ^ d f e ° tive ? arts win be discovered by a final test at’the 
end of the production line, and may be replaced at that time. How¬ 
ever, the cost unit of finding and changing an unsatisfactory part 
is an additional $2.00 per unit. y P 

know “ that 54 of . the P iston r ings in a particular lot are 
bv !i ^ th e Proportion of defective piston rings, would be 
. /2 '0<>o ~ ° 27 ; Jt follows that under these conditions, 100 per cent 
w ould cost $100; no inspection would cost $108 ($2 00 X 
54 defectives . Obviously, in this case, 100 per cent inspection is the 
more desirable alternative. 

hlJ E Io WeVe f ; Un< f ei ’ ° ther c ° ndit i ons no inspection may be the prefera¬ 
ble alternative. For example if w were equal to .01 (20 defectives out 
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of the 2,000), then the cost of using 100 per cent inspection would 
still be $100, but the cost of passing the lot uninspected would be 
only $40 ($2.00 X 20 defectives). 

A more complete schedule incorporating these results would appear 

as follows: 


7T 

Cost of 

100% Inspection 

Cost of 

No Inspection 

Preferable 

Alternative 

.010. 

$100 

$ 40 

No inspection 

020. 

100 

80 

No inspection 

025 

100 

100 

Either 

030. 

100 

120 

100% inspection 

040 . 

100 

160 

100% inspection 

050. 

100 

200 

100% inspection 

.100. 

100 

400 

100% inspection 


Graphically, these costs may be shown for various values of tt, as 
in Figure 3.1. By inspection of this figure it can be concluded that if 
tt for any lot were known, the manufacturer would use the following 
rule: If tt were less than .025, he would use no inspection; but if tv 
were greater than .025, he would use 100 per cent inspection. 


Figure 3.1 


COST FOR LOT 
(DOLLARS) 



The foregoing analysis in terms of costs has pointed up the fact 
that tt is the appropriate decision-parameter. Note, in particular, 
that the consequences of a decision depend not only on the alterna¬ 
tive selected but on the value of t r as well. Thus, if it were available, 
knowledge of i r would permit rational action. 
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Unfortunately, n rarely can be made available—to make it avail¬ 
able would require complete inspection, in which case the decision 
problem would vanish. When tt is unknown, can the decision be 
based on sample information? The answer is yes, though it must be 
qualified, as is explained in later chapters. 

The proportion occurs as a decision-parameter in both enumera- 
tive and analytical studies. This means that n may be the pertinent 
fact about a finite population or about an infinite population. Its use 
in the case of enumerative studies and, hence, with finite popula¬ 
tions is clear cut. However, the notion of a proportion in analytical 
studies and, thus, with infinite populations may trouble you. How 
can 7 r be computed in such a case? For example, we never can count 
all the defectives a production process might produce and divide this 
by the total number of parts it would turn out in the long run, 
thereby calculating the proportion of defectives. Such data are im¬ 
possible to obtain—hence, such a calculation cannot be made. 

However, consider the population of all possible outcomes as¬ 
sociated with flipping a coin. While we cannot count the total num¬ 
ber of heads or tails in such a population, we can imagine the relative 
frequency of heads, after a large number of trials, to be approxi¬ 
mately .50 (if the coin is unbiased), or perhaps .55 (if the coin is 
somewhat biased). Similarly, we can think of the proportion of de¬ 
fectives a machine produces in the long run as possibly equal to .03, 
or .012, or .054, the exact proportion depending on the efficiency of 
the operation. Intuitively, what we are considering is the mathe¬ 
matical notion of a limit—what proportion would occur in the long 
run. 

The whole matter of calculating the population proportion ex¬ 
actly is usually academic. It is impossible to calculate ir in analytical 
studies because complete information is impossible to obtain. In 
addition, ?r rarely can be calculated even if we are dealing with a 
finite population—in most of these cases complete information also 
is unavailable. In both types of situations, statistical methods and 
theory provide ways of estimating tt or of otherwise acting in the 
absence of complete information about the exact value of that de¬ 
cision-parameter. But, in the planning stage of a study it is still nec¬ 
essary to focus on the appropriate decision-parameter and to answer 
the question of how it is to be used. 

One other extension of the idea of a proportion as a decision- 
parameter involves the thought that in some problems not only one 
but several population proportions may represent the information 
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needed by a decision maker. For example, a marketing problem may 
require information on the proportion of consumers preferring 
brand A to all other brands, the proportion preferring brand B to all 
others, and the proportion preferring brand C to all others. Or, an 
economic survey of businessmen’s expectations about the trend of 
prices may involve obtaining information on the proportion of ex¬ 
pectations of an increase, the proportion of expectations of no 
change, and the proportion of expectations of a decrease. Such prob¬ 
lems simply present situations in which there are several decision- 
parameters. 

What is the aim of a statistical study in any problem of the type 
discussed in this section? Generally, it is the same aim as that of any 
statistical study—to provide a basis for decision. Specifically, it is to 
provide a basis for decision even in the absence of knowledge about 
the true value of ir —the true value of the decision-parameter which 
represents the true state of the world. This it may do by providing 
sample information, within economic limits, which reduces the un¬ 
certainty about the true value of the unknown population propor¬ 
tion, thereby permitting rational action even in the face of uncer¬ 
tainty. Of course, if the population proportion were known, or could 
be made known, an appropriate decision could be made without any 
uncertainty about 

3.3 The Aggregate 

In this section we shall turn to another decision-parameter which 
occurs in a number of statistical decision problems. This decision- 
parameter is based simply on the idea of a total or a sum of numbers. 
In the field of statistics, however, this measure generally is called the 
aggregate. 

Consider a population of variates; that is, a total set of numeri¬ 
cal observations that may arise in a given decision problem. As its 
name implies, the aggregate is, in such a case, the sum of all the vari¬ 
ates in the population. 

Thus, in this section we shall be studying the aggregate as a 
property of a population. An important question in this connection 
is: “In what types of problems does the aggregate represent key in¬ 
formation about a population?” That is, when is the aggregate the 
decision-parameter of interest in a problem? 

The general answer is that the aggregate is the key decision-pa¬ 
rameter when it adequately describes the states of the world in a de¬ 
cision problem—when the consequences of a decision depend on the 
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decision made and the value of the population aggregate as well. 
Let’s reinforce these general ideas with some specific illustrations: 


Example 3.8 An auditor of the books of a small textile firm must 
verify the company’s total of accounts receivable. The firm has 20 
such accounts. If the population of true balances for these accounts 
were known, they might be listed as follows: $1,225.04, $218.96 $.00 
$450.00, $1,100.50, $654.05, $450.00, $876.18, $.00, $.00, $98.00,' $W 
$768.13, $56.08, $1,106.22, $875.00, $.00, $45.55, $2,100.00, $67.05! 

This mass of data, in itself, is not what the auditor needs to make 
his verification decision. By the very definition of the problem, he 
needs to know the aggregate of this population. Again assuming com¬ 
plete data on the true balances are available, this aggregate is: 


$1,225.04 + $218.96 + $.00 + 


+ $67.05 = $10,090.761 


However, since such a total would not be available in the planning 
stage of a study, the auditor should ask himself at that time what he 
might do if he knew the value of this aggregate? For example, sup¬ 
pose you were the auditor and the firm listed accounts receivable as 
$10,090.22 on their books. If you were later to find, as above, that the 
aggregate was $10,090.76, what course of action would you take? 
Would you certify their figure or refuse to do so? What if the book 
balance was $12,897.22 and you were to find, after your study, that 
the true balance was $10,090.76? 


# In the last section we gave the name ?r to the population propor¬ 
tion. In the same spirit, we may also call the population aggregate 
A (simply the first capital letter of the English alphabet, but also, 
if you prefer, the Greek capital letter alpha.) Thus, in Example 3 8* 
A = $10,090.76. 


Example 3.9 The policy of a large retail store is to stock $15,000 
worth of costume jewelry for the Christmas season. Some stock, the 
value of which is unknown, is on hand for this year’s coming season. 
The present stock may be thought of as a population aggregate and 
symbolically represented by A. 

The decision problem is to decide how much, if anything, to pur¬ 
chase. The aggregate is, in this situation, the key decision-parameter. 
It describes the problem situation; hence, it describes states of the 
world. The consequences of any decision depend not only on the al¬ 
ternative selected but on the value of A as well. 

Thus, if the store determines the value of A, they obviously can 
make a purchasing decision. For example if A were known to equal 
$9,500, a supplementary purchase of $5,500 would bring the stock up 
to the required total value of $15,000. But if A were found to be $17,- 
450, the store already would be well stocked, and, in fact, over¬ 
stocked. No further purchases would be necessary. 
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In general, A serves as the decision-parameter m the following 
way: If A is less than $15,000, the store will buy {$15,000 - A) dol¬ 
lars worth of jewelry; while if A is equal to or greater than $15,000, 

they will make no further purchases. 

Note how this decision rule relates actions to observations before 
the observations are to be made. It implements the idea of a priori 
planning Also make note of the fact that a statistical study, if it 
were to be used in such a problem, would aim at estimating A on the 
hpsis nf sample information. 


The last two illustrations serve to indicate only two of the many 
decision problems in which a population aggregate is the key deci¬ 
sion-parameter. For problems in the field of accounting, information 
may be needed on total payroll, on the aggregate value of physical 
property or on the sum total of checking account balances. Various 
economic problems require information on the economy’s national 
income (the aggregate earnings of labor and property that arise from 
the current production of goods and services by the economy) or on 
the total amount of money in circulation or on total consumer credit. 
A purchasing agent may need to know the total backlog of unfilled 
orders in a supplier’s industry before deciding on his own buying pol- 
icy. A real estate corporation may need data on the aggregate 
amount of office space currently available in a city before making 
further construction plans. In all of these problems, and in many 
others as well, you can see that attention centers on some aggregate. 

The aggregate is, of course, a reasonable property only for finite 
populations. The aggregate for any infinite set of variates ordmari y 
would be infinite. Thus A may be a decision-parameter only m enu- 
merative studies, never in analytical studies. 

In any problem in which A is the decision-parameter and it is 
known or it can be made known easily, the problem vanishes—no 
uncertainty about A would exist and a decision could be made in the 
face of complete information. If A is unknown, however the problem 
may still be decided on the basis of a sample study; that is, partial 
information derived on the basis of statistical methods and theory 
could be used to estimate A or to otherwise provide a basis for deci- 


In addition to ■* and A it will be convenient, at this time, to define 
and utilize some other statistical symbolism which we shall be using 

occasionally in the course of our studies. . 

First of all, the total number of observations m a population will 
be denoted by N. Thus IV is a general symbol for the size of a popula¬ 
tion. Hence, in Example 3.8, since we were dealing with 20 variates, 

N = 20. 
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Secondly, each individual observation in a population will be de¬ 
noted by X However, in order to distinguish between observations 
we must add a subscript to each X. Thus, if a population is in some 
specified order, X, will refer to the first observation in the population, 
X 2 will refer to the second observation, X 3 to the third observation, 
?t c 'The last or Xth observation will be denoted by X v . For example 
m ilixample 3.8: ? 


X 1 = 1,225.04 , X 2 = 218.96 , X 3 = .00 , 


, X N = X 2 o = 67.05 . 


(The three dots, incidentally, mean “etc.”) 

In terms of this notation, you can note that a population may be 
represented, m a general way, by the set of observations (X^X^Xg, 
• • • , x). Now, as you know, a decision-parameter is some impor¬ 

tant property of a population. Alternately, we may say that a deci¬ 
sion-parameter is some function of the N possible observations 

, x N ). 


The aggregate is a particularly simple decision-parameter; that is 
it is a particularly simple function of the population (X t . X,, X 8 ,' 
• - • , X x ). It is merely the sum of these observations. Thus since 
we have named the aggregate A, we may define it symbolically as • 


^ d" ^"2 + X$ -f- . . . -f- Xn . 

One last refinement in this connection involves introducing an 
operational symbol for the operation of summing. Whenever we sum 
a series of numbers, we may symbolize that operation by 2 (the 
Greek capital letter sigma). 2 is similar to such operational symbols 
as + which tells us to add, X which tells us to multiply, and - 
which tells us to divide. 

To make use of the 2, let us denote some value in a population by 
X^—this refers to the ith observation in the population, where i is 
itself a general symbol which may take any value from 1 to N. If we 
want to express the operation of summing all of the X 4 values we 
may then write 2X 4 . This says “sum the values of X f ,” Which values 
are they? This depends on what we let i equal. For example, 


2X 4 

means X 1 + X 2 + X 3 . Similarly, 


(*' = b 2, 3) 


^ (*’ = 2, 3, 4, 5, 6) 

means X 2 + X 3 + X 4 + X 5 + X 6 . Furthermore, 

2Xi (* = 1, 2, ... , N) 

means sum everything; that is, it means X, + X 2 + . . . + X x . 
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In conclusion, by using this summation symbol we may define the 
aggregate in a compact formula, namely, 

A = 2Xi (i = 1,2, . . . , N) . 

This expresses the thought that the population aggregate is the sum 
of all the values of X, with i ranging from 1 to IV. 


3.4 The Arithmetic Mean 


Another well-known and often-used idea that of an average 
is the basis for a third decision-parameter which frequently occurs m 
statistical decision problems. This decision-parameter is called the 
arithmetic mean. 1 

Undoubtedly you are quite familiar with what most people call 
the average. For example, many students are quite adept at averag¬ 
ing their quiz grades—if one receives grades of 85,100, and 100, usu¬ 
ally he can calculate his average mark of 95 rather quickly, and even 
another with grades of 50, 50, and 60 can determine his average of 
53% in due time. Thus, the average is, by custom, the sum of a set 
of values divided by the number of those values. 

In statistics, however, the average most often is called the arith¬ 
metic mean. This name is preferred to the plainer term “average 
for various reasons, including the fact that there are many measures 
called “averages” in statistics. We shall, for example, meet two other 
“averages” in the next section. ^ 

The arithmetic mean is used in many ways in and out of the field 
of statistics. For the present, however, we are interested in only one 
of these uses—the applicability of the arithmetic mean as a decision- 
parameter in statistical decision problems. In this section, then, we 
shall study the arithmetic mean as a population measure and con¬ 
sider the type of problem in which the mean adequately describes 
states of the world; that is, in which it represents the key fact about 
a population which a decision maker would like to know. 

Example S.10 A food products manufacturer purchases large 
quantities of an oil-bearing seed. These seeds are crushed for their 
oil which is one of the products the company markets. The manu¬ 
facturer periodically faces a decision problem on whether or not to 
accept a given shipment of seeds. In each case the average or mean 
number of pounds of oil per bushel may be adopted as an appropri¬ 
ate decision-parameter. 


1 The term “average” is very old, dating back at least to the 
The term “mean” is very probably even older and was known 


fifteenth century, 
in the sixth cen¬ 


tury, B.C. 
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Suppose that for a particular shipment the cost is $2.40 per bushel. 
Further assume that the value of a pound of oil is $.30. If the average 
number of pounds of oil per bushel of seed were 10, the average value 
of a bushel would be $.30 times 10, or $3.00. Ignoring any other costs, 
it would be profitable for the firm to accept that shipment. ’ 

However, if the arithmetic mean of the amount of oil per bushel 
were only 7.5 pounds, the average value per bushel would be only 
$2.25 ($.30 X 7.5). The cost of $2.40 would indicate clearly that ac¬ 
ceptance of the shipment would be unprofitable. 

The consequences of any decision depend on the value of the mean. 
It therefore represents a method of statistically describing states of 
the world for the problem. 

However, the mean never could be made known exactly in this 
type of problem; at least not in time for the decision. Why? Never¬ 
theless a well-selected sample may provide an alternative decision 
procedure. 


Example 3.11 Among the many facts on which stock market in¬ 
vestors base their financial decisions are the so-called “stock market 
averages.” Not all of these averages are arithmetic means, but most 
of them are essentially of that form. Those that are may be inter¬ 
preted quite simply. 

A market average measures the average value of shares of stock in 
a well-defined portfolio of stocks. The definition of such a portfolio, 
although a difficult problem, is not, primarily, a statistical one. It is' 
in essence, an exercise in the definition of a population, and as such, 
the responsibility for determining the adequacy of any portfolio rests 
with an expert in finance. 

For example,, suppose a portfolio, deemed adequate for an inves¬ 
tor s purposes, is defined as 25 shares of American Telephone and 
Telegraph Company, 100 shares of International Shoe Corporation, 
and 50 shares of Ford Motor Company. On a given day, if the closing 
prices of these stocks are $175, $33, and $42, respectively, the total 
value of the portfolio would be 


$175(25) + $33(100) + $42(50) = $9,775 . 

Hence, the average value per share is $9,775 divided by the total 
number of shares (25 + 100 -t- 50 ~) 175, namely 


,775 


175 


$55.86 


to the nearest cent. 

Almost all stock market averages may be interpreted as the aver¬ 
age value of some portfolio of stocks. As such any one presents an ex¬ 
ample of the arithmetic mean as a decision-parameter. 


. E xam ple 3.12 The Revolutionary War brought with it a severe 
inflation, and the colonies’ paper money apparently depreciated quite 
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rapidly. To offset the effect of the price rise, Massachusetts passed a 
law which provided for an average of price relatives to be used in 
determining the payment of wages to American soldiers from that 
colony. 2 

This scheme was designed to compensate for the rapid depreciation 
of the colonies’ paper money during the war period by paying the 
Massachusetts soldiers in “depreciation notes.” These notes were re¬ 
deemable in amounts varying proportionately to the average of the 
relative prices of four commodities—corn, beef, wool, and sole 

leather. . . 

Thus, “base period” prices were set for these commodities as their 
cost at the end of 1776; namely, in shillings/pence, as: 

Beef (lb.). /3| Wool (lb.).2/0 

Corn (bu.).4/0 Leather (lb.).1/3 

Furthermore, “current” prices were collected monthly by a group 
of about one dozen prominent citizens, who had been appointed for 
that task. Thus, in June, 1777, the following prices (again in shill¬ 
ings/pence) were reported: 

Beef (lb.). /8 Wool (lb.).4/0 

Corn (bu.).6/0 Leather (lb.).1/3 

Based upon these data, price relatives were computed for each of 
these commodities. That is, the ratio of each current price to the cor¬ 
responding base period price was computed. Thus, for June, 1777, 
the price relative for corn was 6 divided by 4 (shillings), or 1.500, in 
fact, as reported by the prominent citizens, the 4 price relatives were 

Beef (lb.).2.284 Wool (lb.).2.000 

Corn (bu.).... 1.500 Leather (lb.).... 1.000 

and their average, the depreciation rate, 

2.284 + 1.500 + 2.000 + 1.000 ^ x m 
4 

(You might check their arithmetic!) 

This result indicates that prices rose 69 per cent between the end 
of 1776 (the base period) and June, 1777, at least as measured by 
this average. In any event, the average served as a basis for the deci¬ 
sion on how much to pay Massachusetts’ soldiers. For example, in 
June, 1777, a depreciation note issued at the end of 1776 was redeem¬ 
able at 169 per cent of its face value. 

Today economists call averages of price relatives index numbers. 
Roughly speaking, index numbers measure over-all changes in the 
level of prices (or in the level of some other economic concept). They 
often are used as decision-parameters today in much the same way 
as in our example from 180 years ago. Thus, the wages of over 4 mil¬ 
lion blue-collar workers in the United States are tied, by contract, to 

2 Willard C. Fisher, “The Tabular Standard in Massachusetts History,” The 
Quarterly Journal of Economics , Vol. XXVII (May, 1913), pp. 417-54. 
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the Consumer Price Index, an index number of prices (for about 300 
items) published by the U.S. Bureau of Labor Statistics. 

As noted earlier, some statistical problems involve not one but two 
or more decision-parameters. Thus, at times, the average alone is not 
sufficient information about a population unless supplemented by 
other criteria. 

Example 3.13 A foundry produces iron castings. It must decide, 
hourly, whether or not its manufacturing process is operating satis¬ 
factorily. If so, it takes no action; if not, it looks for trouble in the 
process, and if found, the trouble is corrected. An elementary unit is 
a casting; a characteristic of interest, its weight. The population is 
the totality of weights the foundry’s process could turn out if it op¬ 
erated indefinitely under substantially the same operating condi¬ 
tions. What are the decision-parameters? 

One is the arithmetic mean of the weights—this obviously is im¬ 
portant for if the castings are too light, on the average, the foundry 
is not giving customers enough for their money. However, the arith¬ 
metic mean, in itself, does not describe states of the world completely. 
Another important consideration in the problem is that if a casting 
is too heavy (e.g., over 22.5 pounds), it cannot be processed further 
by some customers. Hence, the proportion of castings over 22.5 
pounds is a second important decision-parameter. 

In short, the decision problem involves at least two decision-pa¬ 
rameters: the mean and a proportion. The arithmetic mean, though 
important, is not sufficient information in itself to reach a decision. 

The arithmetic mean of a population may be represented, sym¬ 
bolically, as (the Greek letter mu). Thus, ^ is defined, for a popula¬ 
tion of variates, as the sum of all the variates divided by the number 
of them. 

In many problems the arithmetic mean (p) may be used inter¬ 
changeably with the aggregate (A) as a basis for decision. As you 
may recall, N denotes the number of observations in a population, 
and hence the arithmetic mean is simply the aggregate divided by 
N; that is, the arithmetic mean and the aggregate are related in the 
following ways: 


v = and A = Nfx . 

In several problems, however, it is advantageous to deal in terms 
of the mean rather than the aggregate. For one thing, the proce¬ 
dure of taking an average reduces the aggregate to a per unit figure. 
Thus, even if we knew that the total annual payroll in one company 
is $1.5 million and in another it is $1.2 million, the figures might be 
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misleading until we knew the average wage per worker or per some 
other elementary unit. This, however, depends on the problem 
situation. 

For another, the aggregate has no meaning in analytical studies 
since, as noted, the sum of an infinite number of variates ordinarily 
is infinite. But an infinite population still may have an arithmetic 
mean. For example, to determine coffee bean requirements we might 
like to know the average (arithmetic mean) weight of coffee we are 
packaging in “pound” cans. Such a study is analytical, dealing as it 
does with a process—hence the population of net weights is infinite. 
It represents all possible weights of coffee the process would turn out 
if it continued to operate indefinitely under the same conditions. 

We may define the arithmetic mean symbolically in terms of the 
notation introduced in the last section; that is, as a function of the N 
possible observations in a population: (X t , X 2 , X 3 , . . . , X N ). 

Thus, 


ii = 


X x + x 2 + x 3 + 


+ x, 


N 


or, more compactly, by reintroducing the summation symbol 

V = b 2, . . . , X) . 


2X< 


Since SX f with i = 1, 2, . . . , N is the sum of all the variates in a 
population, fx is defined as it was earlier—the sum of all the variates 
divided by the number of them. 

The arithmetic mean has several interesting arithmetical prop¬ 
erties, two of which we may note briefly here. Proofs and/or further 
examples are left to your curiosity and discretion. 

1. Given a set of N numbers with a mean fx\ if another number less 
than fx is added to the set, the new mean is less than fx. For ex¬ 
ample, the mean of 2, 3, and 4 is 3; if we add 1 to our original 
set, the mean decreases to 2.5. Similarly, if a number greater 
than (x is added to a set of numbers, the mean increases. What 
happens if a number equal to the original mean is added to a set 
of numbers? 

2. The sum of the differences between each individual value and 
the arithmetic mean invariably equals 0. For example, the mean 
of the numbers 3, 4, and 8 is 5. The differences or “deviations” 
from the mean are 3 — 5 = —2, 4 — 5 = —1, and 8 — 5 = 4-3, 
and the sum of —2, —1, and +3 is 0. Symbolically this prop¬ 
erty may be expressed as 


S(Xi - fx) = 0 


(i = 1 , 2 , 


N) 
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3.5 The Median and the Mode 

It is a most amazing fact that so many business decision problems 
can be solved by information about a population proportion or an 
aggregate, or an arithmetic mean. These three decision-parameters 
are simple yet useful m a large variety of situations, and, hence, a 
good part of our introduction to statistics will center on them. 

At this point, however, we may make note of the fact that there 
exist many other decision-parameters which occasionally represent 
pertinent facts about a population of study and briefly reinforce 
this assertion with a discussion of two such measures. These two 
measures are called the median and the mode , 3 

Let’s consider the median first. The median of any given set of 
numbers may be described as the middle or central value in that set 
o numbers. Or it may be described as that value above which and 
below which half of the set of numbers lie. Thus, we may say, for the 
moment, that when any set of numbers is arranged in order of mag¬ 
nitude, the middle value is referred to as the median. Suppose we 
ook at a numerical illustration or two before trying to define the 
median more precisely. 

h0Urly wages of five ™rkers are $2.10, 
$2.08, $-.15, $2.00, and $2.12. To determine the median we must ar- 
range these five values in their order of magnitude—$2.00 $2.08. 
$2 10, $2.12, $2.15. Then the median is $2.10 since that is the middle 
value. Such a “middle value” always exists when we are dealing 
with an odd number of values. 

However, suppose we want to derive the median for an even num¬ 
ber of values. For example, the monthly rents for six apartments are 
$128 $193, $150, $187, $162, and $96. To find the median rent we ar¬ 
ray these six values $96, $128, $150, $162, $187, $193. What is the 
median ? 

It should be the middle value, but unfortunately there is no middle 
value; there are two, namely, $150 and $162. In any such case, ei- 
ther of the two middle values or any number between them may be 

,, ^ ari ;f earson ^57-1936) who introduced the term “mode” may fairlv hp poll^ri 
be 6 inSmf statistics. Any one- or two-sentence comment on Pearson here would 

ff\Z trlr« 
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denoted as the median. However, for convenience, statisticians usu¬ 
ally adopt the convention of defining the median as the mean of the 
two central values. Hence, under this convention, the mean of $150 
and $162, namely, $156, is the median of the above rents. 

A precise definition of the median of a set of numbers is as follows: 
The median is a value that is neither greater than more than half of 
the numbers nor less than more than half of the numbers. 

Thus, the median of an odd number of values is, by definition, 
always the middle value when the complete set of numbers is ar¬ 
rayed. The median of an even number of values is, by convention, 
the mean of the two middle values. 

In what types of problems is the median useful as a decision-pa¬ 
rameter? The answer to this question is standard—any problem in 
which it is a pertinent fact which a decision maker would like to 
know. However, to better recognize its ability to serve as a decision- 
parameter, it is helpful to know certain properties of the median. 

As was implied earlier, the median generally is regarded as an av¬ 
erage— n ot the average, which is the term laymen reserve for the 
arithmetic mean, but simply as one of several averages. Like the 
mean, the median summarizes a population in one single, “typical” 
number. It is natural, then, to compare the median to the arith¬ 
metic mean. 

Every population of variates has a mean and a median as well. In 
some populations the values of these two measures are equal; in 
others the mean is larger than the median; and in still others the 
median is larger than the mean. One reason for this rests with the 
fact that the mean is influenced by extreme values while the median 
is not. For example, suppose we have three families’ annual in¬ 
comes—$3,000, $4,000, and $38,000. The mean of these incomes is 
$15,000; the median is $4,000. The mean is larger than the median 
because the mean is more affected by the one large, extreme income 
of $38,000. 

Example 3.14 One point of conflict in labor-management wage 
negotiations is the appropriate decision-parameter with which to 
measure average wages and upon which to negotiate. One of manage¬ 
ment's chief interests is total costs, hence it focuses on the arithmetic 
mean—the mean wage multiplied by the number of employees equals 
total payroll. A union, however, usually is more interested in the me¬ 
dian wage since it is a better criterion to determine the welfare effect 
of a wage change on its members; 50 per cent make the median wage 
or more and 50 per cent make the median wage or less. 

This usually presents a conflict situation, for the mean (upon 
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which management focuses) is often considerably larger than the 
median (upon which the union focuses) because the mean is more af¬ 
fected by a few large, extreme wage rates for highly skilled workers. 


In general an answer to the question of whether the mean or the 
median of a population is the larger, or whether the two are equal 
depends on how the variates in the population are distributed To 
see this graphically, consider Figure 3.2 which shows three popula- 
i°n models - The first is symmetrical, and in any such population the 


Figure 3.2 


SYMMETRICAL DISTRIBUTION 



ARITHMETIC MEAN 
MEDIAN 


DISTRIBUTION SKEWED TO RIGHT 




mean and the median are equal in value. The second is skewed to the 
right, and m such a case, as shown, the mean is greater than the me¬ 
dian. Why? Because the very fact that the curve tails off to the 
right means that the population includes some high extreme values. 

hese tend, as they should, to inflate the mean, yet have little in- 

f~° nthe median * The third distribution shown is skewed to the 
left This implies some low extreme values, and hence the mean is 
smaller than the median. Figure 3.2 illustrates the fact that knowl¬ 
edge of the general shape of a population—that is, the population 
model—may be of some aid in determining the relationship of the 
mean and the median. 

Thus, the fact that the mean is affected by extreme values in a 
population while the median is not, points up a general principle to 
follow m choosing one measure or the other as a decision-parameter* 
lhe median is preferable to the mean if you do not want your deci- 
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sion to be affected by extreme values. Of course, for some problems 
the very fact that the arithmetic mean is affected by extreme 
values in relation to their magnitude and to their frequency of occur¬ 
rence makes it the proper decision-parameter. 

Another property of the median which is of some interest arises 
in connection with the absolute differences between the median and 
the variates in the population it represents. (An absolute difference 
is one in which the sign, plus or minus, is ignored.) The property of 
interest is that the sum of the absolute differences between the me¬ 
dian and the individual variates in a population is a minimum; that 
is, smaller than any sum of the differences between the variates and 
any other value. For example, consider an array of numbers. 9, 10, 
and 15. The median is 10. The absolute differences between the orig¬ 
inal numbers and the median are 1, 0, and 5, respectively, and this 
totals 6. This is a minimum; that is, the absolute differences of 9, 
10, and 15 from any value other than the median will add up to 
more than 6. (You might check this by measuring absolute devia¬ 
tions about a value other than the median.) 


Example S.15 A concessionaire has nine refreshment stands lo¬ 
cated at various points on the boardwalk of a popular beach. He 
wants to locate a central supply station at some point on the board¬ 
walk, and he wishes to locate the supply depot so that the amount of 
travel between the stands and the source of supply is minimized. 
Graphically, we may envision the stands and their distance in yards 
from stand A as follows: 


We have introduced a population of observations (distances) into 
the problem. Furthermore, the decision-parameter of interest is the 
median, for this point will minimize the total distance to all other 
points of interest. Thus, the median is the fifth observation, or 800, 

which represents stand E. > < 

The concessionaire might locate at this point. In doing so he will 
minimize the total distance from the supply station to the stands. 
This total distance is, incidentally, 

(800 - 0) + (800 - 250) + (800 - 500) + . . . + (1,000 - 800) = 

2,280 yards. 

Any other location which might be chosen as the supply depot 
would yield a larger total distance. 

However, it should be noted that the simple solution presented 
rests on the assumption that each stand would be supplied equally 
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often. If not, we would have to count each observation (distance) 
m our population in proportion to the frequency with which it re¬ 
quires supplies, and determine the median of that population 

For example, you can see intuitively that if A required supplies far 
more often than any of the other points, total distance would be min¬ 
imized not at E (which no longer would be the median) but at some 
point closer to A. Whatever that point, however, it would be the me¬ 
dian of the population of distances included in proportion to their 
occurrence. 


Now consider a decision-parameter called the mode. The mode of 
a population may be defined as the most frequently occurring variate 
(or attribute) in that population. Consider, as a simple illustration 
a set of five numbers: 2, 3, 3, 3, and 6. The mode is 3 because it oc¬ 
curs more frequently than any other number. 

Graphically, the mode has a simple interpretation. Consider Fig¬ 
ure 3.2. For each population model, the peak or the highest poin't 
which implies the most frequently occurring value, indicates the 
position of the mode. 

The mode, like any other property of a population, is an appro¬ 
priate decision-parameter in situations when it is a key bit of infor¬ 
mation which a decision maker would like to know. While such situa¬ 
tions do not present themselves too often, there are exceptions. 
Generally, these involve a decision problem in which the most com¬ 
mon variate (or attribute) in a population must be selected as a 
basis for action. 


Example 3.16 Stores that sell at a low markup depend on a large 
turnover to make a profit on their operations. They can accomplish 
this by carrying the most typical or modal size of merchandise. A 
fast turnover would be defeated by a policy of carrying many sizes 
Hence, they carry that size which meets the demand of the largest 
number of people. This is, by definition, the mode. Therefore, in or¬ 
der to decide on what merchandise to stock, the mode is the decision- 
parameter of interest. 

Note the difference in objectives between the stores who buy for 
the modal customer and others, such as the haberdasher in Example 
3.3, who buy for the whole distribution of customers. 

3.6 Variability in Populations 

As has been pointed out on one or two occasions, populations are 
characterized, almost invariably, by some variability —by some dif¬ 
ferences in the observations that make up a population. Your sta¬ 
tistics quiz grades vary from test to test. Wage rates differ from oc¬ 
cupation to occupation. Prices differ from product to product The 
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number of persons reading The New York Times varies from day to 
day. Even the diameters of ball bearings differ from one bearing to 
another—although a producer hopes that this variation is not great. 

In this section, and the next as well, we shall study this idea of 
variability in populations, how it is useful in statistical decision prob¬ 
lems, and the ways in which we can be more specific about the no¬ 
tion of variability itself. To start being more specific, suppose we 
consider a small population of five wage rates: $2.20, $2.60, $2.10 
$2.25, and $2.35. This simple population obviously is characterized 
by some variation—the wage rates differ. But how much variation is 
there in this set of numbers? This is one of the basic questions with 
which we must now deal. 

Unfortunately, it is a difficult question. How, to consider a sim- 
ilar question, do you measure the consistency, or lack of consistency, 
in your quiz grades? One student gets grades of 50, 55, 50, and 60. 
We say among other things, that he is relatively consistent. An- 
other receives quiz marks of 70, 100, 65, and 95. He is relatively in¬ 
consistent. Still another gets grades of 95, 95, 95, and 100. He is al¬ 
most perfectly consistent. But what do we mean by consistency? Is 
there any way of measuring the consistency of, or the variation in, a 
set of numbers (e.g., wage rates or quiz grades)? 

The answer to this question is: “Yes, there is a way of measuring 
variability ” In fact there are several ways, three of which we will 
consider in this section. An analogy with some of our previous work 
also may help in pointing up where we are headed. The arithmetic 
mean, the median, and the mode, which we have studied m the last 
two sections, present numerical measures of an “average” for a pop¬ 
ulation Now we are interested in a numerical measure of variation 
in a population. In the first case we are concerned with a single num¬ 
ber which represents an average value for a population. Analo¬ 
gously, in the second case we want to define a procedure for arriving 
at a single number which represents the variation in the population. 

Before we attempt to define any such measure of variation, it is 
reasonable to ask why we should bother doing so. There are several 
reasons, one of which we will consider now. This involves the fact 
that the consequences of decisions in some problems depend on the 
variation in a population of study. In such cases we need an appro¬ 
priate measure of the variation to describe states of the world, buch 
a measure, if known, would be the pertinent fact a decision maker 
needs in order to choose among his alternatives. Thus, m short, one 
reason we want a measure of variation for populations is so that it 
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will serve as a decision-parameter. Let’s look at some illustrations 
in which variability is an important criterion for making a decision. 

Example 3.17 A common decision problem in production control 

occurs m choosing between two (or more) ways of manufacturing the 
same item; that is, in selecting that process which is capable of pro¬ 
ducing the better (or best) results. What does better (or best) mean? 

It may have several interpretations. One important one, which is 
pertinent to. our present discussion, flows from the fact that one of 
the main objectives of every manufacturing company is to produce a 
relatively uniform product. 

Thus, with respect to this matter of uniformity, or alternately that 
of variation, one process is better than another if it produces a prod¬ 
uct which is more uniform, or less variable. Consequently, one deci¬ 
sion-parameter of interest is some measure of the variation in the 
populations associated with the processes on which the decision is to 
be made. 

EmmpU 8.18 Jones and Smith both invest in the stock market. 
They have, however, different.objectives. Jones' goals are typical of 
a large class of people who aim for a steady dividend income and 
who want to take little risk. Jones and similar investors are inter¬ 
ested in stocks whose dividends promise little variation. Smith's 
goals are different. He is a speculator and is interested in those 
stocks whose prices promise considerable variation. Thus, variation, 
or the lack of it, is one important criterion for Smith or Jones in de¬ 
ciding whether or not to purchase a given security. 


Example 8.19 Until recent years, ladies’ coats of a given size were 
all manufactured and sold with uniform lengths. Firms that manu¬ 
factured coats used only some average length as their decision-pa¬ 
rameter. Short girls, upon buying a coat, had to cut off material or 
otherwise shorten it. Tall girls had to let down a hem to lengthen 
their garment. In addition, manufacturers had to use enough mate¬ 
rial on each garment to meet the length requirements of tall girls. 

Recently, however, the industry has begun to take variability into 
consideration in the production of ladies' coats. Every coat size is 
now made in three, lengths: short, regular, and long. For the manu¬ 
facturer, using variability as a criterion for decision means reduced 
costs on short and regular garments; while for many consumers it 
means a better fitting garment. 

Thus, we have sufficient motivation for defining a numerical meas¬ 
ure of variation. But the problem of exactly how to define such a 
measure is still a difficult one. Suppose we begin to overcome this 
difficulty with a simple, though not altogether satisfactory, solu¬ 
tion. 
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The Range. Such a solution is provided by a measure called the 
range. The range of a population is defined as the maximum value 
less the minimum value. Thus, for a population of wage rates, such 
as $2.20, $2.60, $2.10, $2.25, $2.35, the range is $.50 (the maximum 
value, $2.60, less the minimum $2.10). 

, Unfortunately, despite its simplicity, the range is not too use¬ 
ful as a decision-parameter. Its value depends only on the two ex¬ 
treme variates in a population; it in no way expresses the variation 
of the other variates lying between these two extremes. Ordinarily 
the consequences in a decision problem depend on some over-all 
measure of variability rather than on the range alone. Thus, the 
range is not particularly useful as a decision-parameter. It does have, 
however, other uses in statistics. 

The Average Deviation. Since the range has limitations as a 
measure of variability in a population, suppose we consider another 
possible approach. Note first that if all observations in a population 
were identical, there would be no variability. Furthermore, in such 
a case there would be no difference between any one value and the 
arithmetic mean of the observations since the mean would be identi¬ 
cal with each and every one of the individual observations. How¬ 
ever, the more variable a population, the greater would be the dif¬ 
ferences between the mean and the individual observations. 

Suppose, then, we focus on these differences or deviations of the 
individual observations from their arithmetic mean. Specifically, 
let’s once again consider our small population of wage rates: $2.20, 
$2.60, $2.10, $2.25, and $2.35. The arithmetic mean of this popula¬ 
tion is easily determined as follows: 

$2.20 + $2.60 + $2.10 + $2.25 + $2.35 _ $11.50 _ f9 

M = 5 5 ‘ 

Therefore, the deviations on which we are interested in centering 
our attention are as follows: 

$2.20 - $2.30 = - $.10 
$2.60 - $2.30 = + $.30 
$2.10 - $2.30 = - $.20 
$2.25 - $2.30 = - $.05 
$2.35 - $2.30 = + $.05 

We will consider two measures of variability which are based on 
deviations such as these. 

First, however, note that it would be futile to attempt to define 
a measure of variation as the average of these deviations because 
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the sum of the deviations about the arithmetic mean is always equal 
to zero. This is true in our simple numerical example since 

- $.10 + $.30 - $.20 - $.05 + $.05 = 0 , 

and it is true for any and every set of numbers, as was pointed out in 
Section 3.4. 

The fact that some deviations are positive and some are negative 
causes our trouble, for the positive total is canceled out by the total 
of the negatives. Suppose, however, that we ignore the signs of the 
deviations and then take their arithmetic mean. That is, suppose we 
define a measure of variation as the average of the absolute devia¬ 
tions of observations from their arithmetic mean. For example, the 
average deviation (or mean deviation) for our wage rate data is: 

$.10 + $.30 + $.20 + $.05 + $.05 $.70 „ 

5 —- -j- - $.14 . 

The average deviation is, obviously, a reasonable measure of 
variability and rather simple to interpret as well. It could, in many 
problems, be adopted as a decision-pafameter. However, this meas¬ 
ure is difficult to deal with mathematically because of the “unnatu- 
ral” algebraic procedure of ignoring the signs of the deviations. 
Furthermore, it is of little use in sampling theory. 

The Standard Deviation. Because of this fact and also because of 
historical reasons, statisticians often employ another measure of 
variation, also based on deviations about the arithmetic mean. 

The difficulty in basing a measure of variation on the deviations 
about the mean lies with the fact that the negative deviations cancel 
out the positive deviations. The average deviation makes use of one 
remedy—ignoring the signs of the deviations. Another solution to 
the dilemma is to base a measure of variation on the squares of the 
deviations. Once again the negative signs on the deviations will be 
eliminated. 

Thus, in the simple numerical situation we have been discussing 
throughout this section, the population of five wage rates is: 

$2.20, $2.60, $2.10, $2.25, $2.35 . 

In view of the fact the mean of these rates is $2.30, the deviations 
about the mean are: 

- .10, + .30, - .20, - .05, -f .05, 
and if we square these deviations, we find 

.0100, .0900, .0400, .0025, .0025 . 
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The average of squared deviations such as these is called the 
variance. Thus, for our numerical example, the variance equals 

.0100 + .0900 + .0400 + . 0025 + .0025 _ , 14o0 _ ^2f)0 
“ 5 5 

One other step usually is convenient in dealing with a measure of 
variation. Because we averaged squared deviations, our answer is, so 
to speak, in the wrong unit of measurement (e.g., dollars squared!). 
Therefore, it is often easier to deal in terms of the square root of the 
variance. This is called the standard deviation. For our numerical 
example, the variance is .0290; hence, the standard deviation is. 

V.0290 = $.17 . 

Throughout our studies of statistics we will have several uses for 
the variance and the standard deviation. 5 These uses include their 
applicability as decision-parameters and, more generally, as meas¬ 
ures of variability in a population. We will prefer these measures to 
the range and the average deviation for reasons which include those 
already noted. Remember, thfen, that the variance is the average of 
squared deviations from the arithmetic mean, while the standard 
deviation simply represents the square root of the variance. Some¬ 
times it will be easier to deal in terms of the variance; other times 
to deal in terms of the standard deviation. 

Because we will be referring to them frequently, it is convenient to 
give the variance and the standard deviation symbolic names. Thus, 
the standard deviation of a population will be called <r (the Greek 
lower-case letter sigma ) and the variance ( sigma squared). Thus, 
for our wage rate illustration 

o- 2 = .0290 and a = V.0290 = $.17 . 

The definition of the variance may be put precisely in terms of the 
symbolism we have introduced throughout this chapter. Given a 
population of N observations, (X 1} X 2 , • • • > %n)> we consider the 
differences between each of these values and their arithmetic 
mean, ^: 

(Xi- n, X 2 -Ht . • • , X N - m) • 

4 You will find Appendix A of some use whenever you need to compute the 
standard deviation and/or the variance. It includes a table of squares and square 
roots, and instruction on how to use it. , 

® The term “standard deviation” was introduced by Karl Pearson in 1893. The 
term “variance” was first used in 1918 by another renowned statistician, >r 
Ronald Fisher. 
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The variance is based on the squares of each of these deviations 
from the mean; that is, on (X*-/*) 2 , (X 2 - /*) 2 , . . . , (X* -,*) 2 . 
Specifically, the variance is defined as 


.2 _ (^1 “ m) 2 + (X 2 - nY + . . . + (X A r - uY 

N 

or, making use of the summation sign, 2, we may write 


__ 2(X< - m )2 
N 


(f = 1, 2, . . . , X) . 


The standard deviation may be expressed in a similar way. Since 
it is simply the square root of the variance, 


a — , w 1 ~ w ~ r • . . t 

\ N 


(Xj p) 2 -f- (X 2 — m) 2 ~~t~ • . . ~h (Xiy — nY 


or, equivalently 


“■ = M)2 (*' “ 1- 2, • • • , AO . 

How do you interpret the variance and the standard deviation? 
Since this question arises so often, perhaps we had better devote the 
next section completely to answering it. 


3.7 The Standard Deviation and the Normal Curve 

You should recognize that the variance (as well as its square root, 
the standard deviation) holds a position of considerable importance 
in the development of statistical methods. Although to some extent 
the choice of that particular measure of variability seemingly is arbi¬ 
trary, the choice is well founded in theory. In any event, the selection 
of a particular measure is of secondary importance as compared to its 
proper utilization and interpretation. Suppose, then, in this section 
we attempt more precisely to interpret M (and <r). 

The first thing for you to remember, generally speaking, is that 
o- 2 , or measures in one over-all number the variability in a popula¬ 
tion. It is a summary indicator of that particular population char¬ 
acteristic. The greater the variability in the population, the larger is 
the value of o - 2 or <r. Secondly, you must note that most measures of 
variability, including the variance, represent a measure of the extent 
to which there are differences between individual observations and 
some central or average value. This is precisely what variation means 
-—departure from a norm. In the case of as we have noted, differ¬ 
ences from the arithmetic mean of a population are involved in its 
definition. 
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To interpret a given value of <r more precisely, one ordinarily must 
think in terms of some population model. Thus, in many problems, if 
we assume a certain population model, we may deduce the relative 
frequencies of observations that fall within one, or two, or any given 
number of standard deviations from the arithmetic mean of that 
population. In order to illustrate this point in a concrete way, we 
may reconsider a population model briefly touched on earlier; the 
normal curve. 

The normal curve apparently was first discovered by a French 
mathematician, Abraham De Moivre, 6 in 1733, in connection with 
his studies of games of chance. Over the next 50 years his discovery 
went unnoticed. Meanwhile several other mathematicians inde¬ 
pendently derived the normal curve. One of these rediscoverers was 
the German mathematician, Karl Gauss. 7 In the subsequent devel¬ 
opment of mathematical statistics, De Moivre’s work was forgot¬ 
ten, and the theory of the normal distribution stemmed from the 
work done by Gauss. For this reason the normal curve is sometimes 
referred to as the Gaussian distribution. 

The work of Gauss and his contemporaries led to a wide applica¬ 
bility being attributed to the normal curve. Thus, in the early days 
of its development many people thought that most statistical popu¬ 
lations could be approximated by this population model. For that 
reason they attached great importance to the normal curve. Today, 
however, with considerably more experience behind them, statisti¬ 
cians are reluctant to make the generalization that most popula¬ 
tions can be assumed normal. But, even so, many populations can 
be assumed normal, and, furthermore, the normal curve has several 
other uses in the field of statistics. Thus, the normal curve holds a 
central place in statistical theory. 

The normal curve may be expressed in terms of a mathematical 
formula. For our purposes, however, we can gain an understanding 
of this curve without resorting to its explicit formula. An analogy 
with the straight line is rather helpful. As you undoubtedly know, a 
straight line is determined by two points. A different pair of points 

6 Abraham De Moivre (1667-1754) lived in England most of his life tutoring in 
mathematics and solving mathematical puzzles and problems, for wealthy patrons. 
Among the questions for whose answers such patrons were willing to pay were those 
dealing with games of chance, i.e., cards, roulette, dice, etc. In this connection, De 
Moivre became one of the originators and developers of mathematical probability. 

7 Karl Frederick Gauss (1777-1855), a famous German mathematician, con¬ 
tributed to almost every known branch of mathematics. His interest in the normal 
curve and in fact his derivation of the curve and its various properties were stimu¬ 
lated by his work in the field of astronomy. 



Oh. 3] 


DECISION-PARAMETERS 


79 


generally yields a different straight line. Similarly, a normal curve 
is determined by two characteristics—its arithmetic mean and its 
standard deviation. Thus, if we pick an arithmetic mean and a 
standard deviation, we determine one normal curve; pick another 
/* and a and we determine another normal curve. Generally, each 
different pair of these characteristics yields a unique normal curve. 
For example, Figure 3.3 shows us three different normal curves. 

Figure 3.3 



Notice that the centers of these normal curves all lie at different 
locations on their respective scales—their positions are governed by 
the values of a*.. In addition, some of the curves are fatter (more 
variable) or skinnier (less variable) than others—this is governed 
by the value of the standard deviation because a- measures varia¬ 
bility. As we said, then, a normal curve is determined by these two 
characteristics. 

Ordinarily, a normal curve looks like a bell. At least, it will al¬ 
ways look like a bell if you choose the scales on your graph appro¬ 
priately or, alternatively, if you use your imagination vividly. In 
any event, it is from this fact that the alternative name ’“bell¬ 
shaped” curve obtains. 

Nevertheless, every normal curve, whether it appears short and 
wide or tall and thin in a chart, has certain basic properties. For 
one thing, a normal curve is always symmetrical about its arithme¬ 
tic mean. This means that the right half of the curve is a mirror 
image of the left half, and vice versa. Another interesting fact is 
that the arithmetic mean lies at the point at which the curve hits its 
maximum height. These remarks reflect the fact that the arithme¬ 
tic mean, the median, and the mode of a normal population model 
are all equal. Why? 
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It also should be pointed out that the two “tails” of the normal 
curve trail off indefinitely in both directions. Thus, the range of a 
normal population is infinite. That is, both sides of the curve get 
closer and closer to the horizontal axis as we go farther and farther 
away from the center, but they never quite touch. This is a signifi¬ 
cant idea, and you should try to visualize it. Unfortunately, graphs 
cannot do justice to this point because the tails come so close to the 
horizontal axis that they appear to touch it. 

In addition, it is very pertinent to any interpretation of <7 for the 
normal curve to note that about 68 per cent of the area of every nor¬ 
mal curve is included within the interval: ( 1 ) the arithmetic mean 
less one standard deviation, and ( 2 ) the arithmetic mean plus one 
standard deviation. 

Thus, suppose we have a population that can be represented by the 
normal model. Approximately 68 per cent of the observations would 


Figure 3.4 





have values between (/* — <r) and + <*■). Of course, the actual mag¬ 
nitude of this interval depends on the values of /* and <7 and, hence, on 
the particular population under consideration. Thus, of those normal 
curves in Figure 3.3, 68 per cent of the observations in population 
(a) are between 41 and 49 since = 45 and <7 = 4; 68 per cent of the 
observations in population (b) are between 44 and 56 since ^ — 50 
and a = 6 ; and 68 per cent of the observations in population (c) are 
between 78 and 82 since /* = 80 and <7 = 2. 

Other relative frequencies besides the 68 per cent may be de¬ 
duced for the normal curve. Thus, it can be shown that approxi¬ 
mately 95 per cent of the observations lie within the interval repre¬ 
sented by the mean plus and minus two standard deviations. Almost 
all (about 99.7 per cent) fall within the interval represented by the 
mean plus and minus three standard deviations. These relative fre¬ 
quencies hold for every normal curve. The ideas are shown graphi¬ 
cally in Figure 3.4. 
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Example 3.20 A company manufactures floor tiles. Experience 
has shown that the thicknesses of the tiles that its production proc¬ 
ess turns out may be assumed to be normally distributed. Suppose it 
were determined that the mean of this population is .25 inches and 
that the standard deviation is .005 inches. It would follow that ap¬ 
proximately 68 per cent of the floor tiles the process would produce, 
if it continued to operate under substantially the same conditions' 
would measure between .245 (.25 - .005) and .255 (.25 + .005)’ 
inches. Similarly, approximately 95 per cent of the tiles would have 
a thickness of between .24 and .26 inches, and about 99.7 per cent 
would be between .235 and .265 inches. 

It will help you if you will recall that in any graphic representa¬ 
tion of a frequency distribution, relative frequency is proportional 
to the area under the curve. Thus, the results noted above and those 
shown in Figure 3.4 are derived by measuring areas under the nor¬ 
mal curve. These areas or relative frequencies are determined by 
advanced mathematical techniques. Fortunately, simple tables of 
the results are available, and such a table is given, together with di¬ 
rections for use, in Appendix B. We frequently will refer to the idea 
of relative frequencies or areas under the normal curve and will 
make extensive use of Appendix B. 

For the present, however, note that the interpretation of <x may be 
facilitated by reference to a population model. If our population may 
be represented (approximately) by the normal curve, we know (ap¬ 
proximately) 68 per cent of the observations in the population have 
values between (/» — <j) and (/*. + <r), etc. 

Of course, not every population may be represented by the normal 
model. Thus, this interpretation of it, in terms of the relative fre¬ 
quency of the observations included in a given interval, depends on 
the particular model that is being utilized. Figure 3.5 points this up 
by a comparison of the normal curve with two other smoothed fre¬ 
quency distributions. They are called the exponential population 
model and the rectangular population model. The exponential model, 
being skewed, presents different problems of interpretation. For ex¬ 
ample, approximately 63 per cent of the observations in such a pop¬ 
ulation fall in the interval one standard deviation to the left of the 
mean, while only about 23 per cent fall in the same interval to the 
right of the mean. But note in particular how the relative frequencies 
between, say, the mean plus or minus one standard deviation differ 
from model to model. In the case of the normal curve 68 per cent of 
the observations are included in that interval; in the case of the 
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exponential the relative frequency is 86 per cent; and for the rec¬ 
tangular model it is only 58 per cent. 

One last point in this connection. Suppose no population model is 
being used because none is applicable to the problem. Can we then 
make any statement about the relative frequencies that will fall 


Figure 3.5 

a. NORMAL DISTRIBUTION 



b, EXPONENTIAL DISTRIBUTION 



within the limits set by the mean plus and minus a given number of 
standard deviations? No exact answer can be given to this question. 
However, on the basis of statistical theory, we can say that the rela¬ 
tive frequency of the observations in a population that will deviate 
from the mean by more than k standard deviations is equal to or less 
than 1 /k\ For example, take k = 2. It follows that no more than y 4 
of the observations in a population will be outside of the interval 
0 - 2<r) to (p. + 2a). At least 75 per cent of the observations will be 
within that range. Similarly with k = 3 no more than % of the ob- 
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servations win fall outside of („ ± 3.); at least 88.9 per cent must 
be within that range. 

The general statement given above, known as Tchebycheif’s 8 in 
equality is an example of what statisticians call nonparametric sta¬ 
tistics. Roughly speaking, any theoretical result which does not 

inge on the assumption of a population model falls within this 
branch of our subject. 

3.8 Selecting the Decision-Parameter 

In the proceeding sections we have defined, studied, and illus¬ 
trated several decision-parameters which occur rather frequently in 
statistical decision problems. The choice of some criterion such as 
one of these is a basic step m the planning stage of a statistical study 

l ocuses attention on what a decision maker wants to know 
about the population he has defined. 

Judgment plays an important role in specifying the decision- 
parameter. As noted earlier the basic question that must be an¬ 
swered, of course, is: “What do we have to know about the popula¬ 
tion m order to make a decision?” Often, this is a simple question 
that can be answered readily. At times, however, it is a surprisingly 
difficult one and requires considerable deliberation. In any event 
the choice of a decision-parameter calls for the combined judgments 
of management and the statistician. 

This, however, is not to say that these judgments should be made 
arbitrary or too subjectively. The choice of a decision-parameter 
should follow certain patterns of logical thinking. Thus, whatever 
the choice, the decision-parameter should adequately describe 
states of the world. It should be such that the consequences of a de¬ 
cision depend, at least in part, on the value of that decision-param¬ 
eter. Only m these ways will knowledge of that decision-parameter 
provide a basis for rational action. Judgment plays an important 
role only m determining what is adequate, what may be the conse¬ 
quences and how knowledge of the value of the decision-parameter 
would be useful in deciding on a course of action. In short the 
choice of the decision-parameter depends on the particular circum- 
stances surrounding the problem. 

Example 3.21 An electronics company receives shipments of a 
Specla type of dry-cell battery. Shipments include about SOO bat- 
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teries and for each shipment the decision problem is whether to ac¬ 
cept or to return the batteries to the supplier. What is the appropriate 
decision-parameter by which to measure quality of the lot—that is, 
by which to describe states of the world? This depends on the cir 

cumstances surrounding the problem. 

Suppose the batteries are installed m a radio or some other piece o 
equipment which is sold, through retailers, to consumers. Then the 
consequences of a decision may depend on the proportion 'of bat¬ 
teries that would burn out or otherwise fail m less than 10 hours o 
use (or whatever other figure management may have found gives rise 
to consumer complaints). Hence, would be, m such a situation, the 

appropriate decision-parameter, . . 

Alternatively, suppose the company uses the batteries internally m 
its own manufacturing or research operations. If so, it may be m er- 
ested in how much use the shipment of batteries as a whole might 
afford. Thus, to think in a per battery figure it may be interested in 
the arithmetic mean of the times to failure of the batteries m a ship¬ 
ment. In that case, ^ would be the decision-parameter of interest 

In either case, however, the decision-parameter could not bedeter¬ 
mined by studying the population—if any information was thought 
desirable, sampling would be necessary. Why. 

Suppose a decision-parameter is selected for a problem, yet m 
analyzing their choice, those in charge of the planning of a study 
find that even if known, the measure would not afford a basis lor de¬ 
cision. What is wrong? The study simply has not been properly 
planned—further planning is still necessary. Perhaps their choice is 
a poor one. Perhaps other decision-parameters are needed to supple¬ 
ment the original choice. Perhaps, not a decision-parameter but a 
frequency distribution must be known. The difficulty may be even 
more basic. Perhaps the wrong population of study was defined, or 
perhaps the population is inadequate. The situation can only be 
remedied by the combined efforts of the subject matter experts and 

the statistician involved in the problem. 

If anything of this sort is wrong, however, the only time to 
remedy it satisfactorily is in the planning stage. It is too late to do 
so once a study has come up with incomplete or inadequate answers. 

3.9 You Should Now Know That 

A second step in statistically defining a problem is specifying 
the decision-parameter(s). 

A decision-parameter is a property of a population which is perti¬ 
nent to a decision problem. 

In many decision problems a decision-parameter presents a way 
of statistically describing states of the world. 
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In some problems two or more decision-parameters are required ; 
in others the entire frequency distribution may be needed. 

The choice of the decision-parameter(s) in any problem is gov¬ 
erned by the answer to the question: “What do we have to know 
about the population in order to make a decision?” 

Selecting the decision-parameter(s) in a problem requires the 
combined judgments of management and the statistician. 

Every property of a population is called a parameter; only if 
pertinent to the decision problem is a property called a decision- 
parameter. 

The proportion, the aggregate, and the arithmetic mean are 
three important and very commonly used decision-parameters. 

The aggregate is the sum or the total of a set of observations. 

The arithmetic mean is a statistical name for what often is called 
“the average.” 

Two other simple and occasionally used decision-parameters are 
the median and the mode. 

The range, the average deviation, and the standard deviation are 
three measures of the variability in a population. 

The standard deviation is the most widely used of these popu¬ 
lation properties. 

The square of the standard deviation is called the variance. 

Among the symbolic names we have adopted for decision-pa¬ 
rameters are: 


t: the proportion 
A : the aggregate 
ju: the arithmetic mean 
a: the standard deviation 
cr 2 : the variance 

The normal distribution or normal curve is a particularly impor¬ 
tant population model. 

3.10 You Should Now Be Able to Solve 

I. Almost 40 years ago, one outstanding statistician in the field of 
educational statistics characterized democracy as a frequency dis¬ 
tribution of abilities and inclinations. As he stated: 

The problem of a democracy is to utilize the talents, attitudes, and 
training of the individuals of a distribution for the sake of raising the 
mean of the self-same distribution to higher levels of capacity, attitude, 
and training. None will deny that the elevation of the social mean is a 
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good, while there are cogent social arguments against decreasing the var¬ 
iability of the group. 9 

Explain why most people in a democracy would agree with this 
statement. 

2. A firm may classify a product it produces as defective or non¬ 
defective and use the proportion defective as a decision-parameter 
to describe the efficiency of its operation. On the other hand, it 
might measure some characteristic of the product, such as its width, 
and use the arithmetic mean as a decision-parameter. What consid¬ 
erations would enter into its choice? 

3. A fraternity brother offers to bet you, with handsome odds, that 
you cannot walk across Mindowaskin Park’s lake in Westfield, New 
Jersey. Consider the population of depths at all parts of the lake. 
What property of this population is pertinent to your decision prob¬ 
lem on whether or not to accept the bet? 

4. Two instructors give the same statistics course at your school. 
One gives a final grade on the basis of the mean of a student’s popu¬ 
lation of quiz grades; the other grades on the basis of the median of 
a student’s population of quiz grades. If you had a choice of with 
whom to take the course, which teacher would you select? Why? 
Assume teaching ability, jokes, difficulty of quizzes, and all other 
pertinent factors constant between the two. 

5. Recent articles on education in the United States have used the 
arithmetic mean salary of teachers as a criterion to demonstrate 
that teachers are paid low salaries. Discuss the adequacy of the 
mean as a decision-parameter in this instance. Would you suggest 
an alternative criterion? 

6. What decision-parameter governs the outcome of a presiden¬ 
tial or congressional (or most any) election? 

7. In each of the following decision problems state the statistical 
population of interest and the appropriate decision-parameter or 
decision-parameters. Explain your answers: 

a) A large women’s wear chain receives a shipment of 2,000 dresses. 
Before distributing the dresses to its various outlets it must deter¬ 
mine whether the manufacturer has marked the proper sizes on the 
dresses. 

b) An automatic machine is set to stamp out bottle caps 1.2 inches in 
diameter. You are to decide whether the machine is operating prop- 

9 Truman L. Kelly “Elementary Statistics in High-School Mathematics as a 
Socializing Agency/’ School and Society, Vol. XI (1920), pp. 228-30. 
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erly or not; that is, whether the setting should be left alone or be ad¬ 
justed. 

c) You want to determine the value of the white goods (sheets towels 
etc.) inventory in a large department store. ' ’ 

8. You apply for a position with a large corporation. The person¬ 
nel director tells you the entrance salary and the minimum and 
maximum salaries employees have earned after 5 years with the 
company. Would these decision-parameters offer sufficient infor¬ 
mation on salaries for you to make a decision on whether or not to 
accept the position if it were offered to you? If not, what other de¬ 
cision-parameters would you want to know? 

9. Your company considers purchasing a special type of thermo¬ 
stat from a manufacturer. That firm’s sales engineer, in discussing 
the quality of the product with you, points out that their produc¬ 
tion process is carefully set to produce a thermostat which operates 
at an average temperature of 64 degrees, and that the standard de¬ 
viation of this population is only .2 degrees. “Therefore,” he says, 
“about 68 per cent of the thermostats we would deliver to you will 
operate at between 63.8 and 64.2 degrees, and almost all parts will 
be within your specifications of 63.4 to 64.6 degrees.” What assump¬ 
tions does this statement rest upon? Explain each carefully. 

3.11 You Will Also Find That 

Most of the statistical measures which we have called decision- 
parameters and which we have studied in this chapter are discussed 
in every basic statistics textbook in one framework or another. Al¬ 
though there is some variation in the amount of detail with which 
books treat averages, variability, and the normal curve, you should 
find the following of interest: 

Hirsch, Werner Z. Introduction to Modern Statistics, pp 32-47 49-64 
New York: Macmillan Co., 1957. 

Mandel, B. J. Statistics for Management, pp. 95-117, 120-41. Baltimore 
Maryland: Dangary Publishing Co., 1956. ’ 

Neter, John, and Wasserman, William. Fundamental Statistics for 
Business and Economics, pp. 187-93. New York: Allyn & Bacon Inc 
1956. } ’’ 

Wallis, W. Allen, and Roberts, Harry V. Statistics: A New Approach 
pp. 211-41, 244-53. Glencoe, Ill.: The Free Press, 1956. 

For additional notes on the early history of the arithmetic mean, 
the standard deviation, and the normal curve, see: 

Walker, Helen. Studies in the History of Statistical Method Balti¬ 
more, Maryland: Williams & Wilkins Co., 1929. 
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Sample Selection 


4.1 Introduction 

The first three chapters have laid the groundwork for our study of 
statistical decisions. They have presented the steps necessary to de¬ 
fine many problems statistically. They have considered how one 
goes about answering such questions as: “What is the problem?” 
“What observations are pertinent to my problem?” “What proper¬ 
ties of my population of observations would I like to know?” In 
short, the material we have considered emphasizes how to think 
about one's problem in statistical terms. Usually, this way of think¬ 
ing simply sets the stage for the making of decisions; it tells you 
whether or not there are pertinent observations and how you might 
use them. 

Statistical decisions, of course, are made on the basis of observa¬ 
tions, but ordinarily not on the basis of a complete set of them 
many are based on only a sample. Hence, once a problem is defined in 
statistical terms there still remain questions on how many observa¬ 
tions to obtain and on how to obtain them. Let's begin with some 
general remarks on the first of these points. How many? 

First of all, you must remember that there are three broad answers 
to this question: (1) no observations, (2) some observations (that 
is, a sample), and (3) all pertinent observations (that is, the popu¬ 
lation completely). 

You see, the statistical definition of a problem may put the deci¬ 
sion problem in such a perspective that an answer is obvious. Then, 
no observations would be necessary in order to see which alternative 
is best. Suppose that we further develop this thought with a very 
specific example. 
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Example 4,1 A certain part is made in the machine department 
of an aircraft parts manufacturer. The parts are then shipped to the 
assembly department (in a nearby plant) in lots of 2,000. Thus, the 
decision problem is whether or not a given lot should be inspected 
before the assembling operation. 

Furthermore, the problem can be defined statistically. In brief, a 
part is an elementary unit; a pertinent observation is whether a part 
is defective or nondefective. The total set of these observations is 
the population. The decision-parameter of interest is tt —the propor¬ 
tion of defectives in a lot. 

Now, let’s consider consequences. We may assume that if a lot is 
inspected prior to its shipment, all defectives can be eliminated (al¬ 
though this is a naive assumption). Inspection costs, however, aver¬ 
age about $50 a lot. 

If a lot is not inspected, any defective parts should be picked up by 
a later 100 per cent check after the assembly operation. The co$t of 
this operation is $.10 per item. Therefore, to assemble a defective 
part is to waste 10 cents. 

For any lot of 2,000 we can denote the proportion of defectives by 
7r —this is the decision-parameter of interest. Then the number of de¬ 
fectives is 2,000 times ? r. Thus, the costs, per lot, associated with the 
alternatives “no inspection” and “100 per cent inspection” are as fol¬ 
lows : 


Type of Cost 

Alternative 

No Inspection 

100% Inspection 

Inspection cost. 

Cost of assembling 

0 

$50 

defective parts. 

$.10X 2,000 X tt 

0 

Total Cost. 

$200 X tt 

$50 


For example, for various values of tt (various states of the world) 
the total costs are: J 


Alternative 



—*-*. i . 1 . 

State of the World (tt) 


_ii 

.00 

.01 

.05 

.10 

.20 

.25 

.30 

.35 

.40 

No inspection.. 

$ 0 

$ 2 

$10 

$20 

$40 

$50 

$60 

$70 j 

$80 

100% inspection. 

50 

50 

50 

50 

50 

50 

50 

50 

50 


What do we conclude? Well, for all values of tt less than .25, no in¬ 
spection is the preferable alternative. Only if a lot is more than 25 
per cent defective should 100 per cent inspection be considered. Of 
course, on any given lot of product the company is uncertain as to the 
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actual value of tt. But the company should be almost certain that tt 
is less than .25. If they are not, something is wrong with the produc¬ 
tion process, and revamping this process, rather than inspection, is 
the solution to the problem. 

In short, the company should not consider the alternative of 100 
per cent inspection at all. Therefore, there is no need to make any ob¬ 
servations on a given lot of parts in order to choose between no in¬ 
spection and 100 per cent inspection. 

The choice on whether or not to make observations in this last il¬ 
lustration rests with an analysis of explicit cost considerations. Such 
a direct analysis is not always possible—but the example still illus¬ 
trates the type of thinking necessary for deciding whether or not to 
make observations. In short, as a general rule, one need not make 
observations in a statistical decision problem unless he actually is 
uncertain about which course of action is the wisest. For example, 
there was no such uncertainty in Example 4.1. This was pointed up 
by the statistical definition of the problem. 

However, in many, if not most, problems some degree of uncer¬ 
tainty exists about the true state of the world. Ordinarily, you will 
find that some observations will be of aid in reducing uncertainty. 
However, this may very well depend on whether you have to conduct 
a study yourself or whether you may be able to find information per¬ 
tinent to your problem already collected. Let's consider this point in 
detail in the next section. 

4.2 Primary and Secondary Data 

There are two different ways in which pertinent observations may 
be obtained for a statistical decision problem. First of all, data may 
be derived from a study conducted for the specific purpose of pro¬ 
viding a basis for decision on the particular problem at hand. Infor¬ 
mation such as this may be referred to as primary data, and the 
statistical study designed to provide them may be referred to as a 
primary study. 

At times, however, the information needed in a decision problem 
may be obtained from data already published or otherwise in exist¬ 
ence. Such data may have been gathered for some other purpose 
than coping with the problem of immediate concern. Nevertheless, 
they may be useful in one’s current problem. Data such as these may 
be called secondary data, and the study by which they were obtained 

may be called a secondary study. 

Generally speaking, all data, in a sense, are at one time or another 
primary data. Someone had to collect them, and they had to be col- 
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lected for some purpose. The distinction of primary versus secondary 
t erefore depends on the particular decision problem at hand, and 
on whether or not the data were collected for that problem. 

For example, decisions on whether to accept or to reject a lot of 
product ordinarily are made by the firm receiving the product on the 
basis of a sample. Such a study, therefore, is primary—at least from 
the point of view of the “consumer.” However, suppose a certain lot 
is returned to the “producer” together with a report of the sample 
results and the producer uses those results in deciding whether to 
scrap, rework, or reship the lot. From his point of view and for his 
decision problem, the data provided by the study are secondary. 

In any statistical study we must consider the potential usefulness 
of both primary data and secondary data as a basis for action. How¬ 
ever, m the case of primary data one main question is: “How can 
we conduct a study to obtain pertinent observations?” In the case 
of secondary data the corresponding question of interest is- “Were 
these data obtained by a reliable procedure?” One always should 
conduct a primary study in accord with certain basic principles of 
s atistics—principles you will learn in the course of our studies. Sim- 
1 arly, one always should evaluate a secondary study by these same 
basic principles. Data collected for another purpose may or may not 
be useful for a given decision problem. They may or may not be re- 
hable. They may or may not come up to your standards. We must 
learn how to judge whether or not they do. 

One obvious advantage of using secondary data, if they are reli¬ 
able, rather than primary data is that ordinarily the former are less 
expensive to obtain. In fact, many business organizations often must 
rely on secondary data because a primary study may be far too ex¬ 
pensive. Remember, we always must weigh the cost of securing nu¬ 
merical information against the benefits (the reduction in uncer¬ 
tainty) to be derived from the use of such data. 

Some secondary data are internal; that is, part of the records of 
the firm that arise in carrying out its day-to-day operations of pur- 
c asmg, producing, selling, shipping, advertising, and so forth These 
data may be compiled primarily to set up the accounting records of 
the firm, yet they also may be drawn upon when needed to provide 
numerical data for decision problems that face the business 
. blnce ■ internal records may serve as secondary data for decision it 
is essential m planning what records to keep and in what form’to 
keep them that management be aware of the types of problems 
these data could help to solve. While it is true that many records 
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must be kept primarily for legal reasons, a little thought about their 
potential decision-making uses may avoid the need for many special 
studies. It also may avoid continuous revisions that otherwise would 
be needed to devise accounting systems to include pertinent numer- 
ical data. 


Example 4.2 Losses due to rework and scrap generally accom¬ 
pany each and every operation in the production of a product. Obvi- 
ously, such losses bring about a reduction in profits, and it is impor¬ 
tant to control them. . ... 

One aircraft company has developed an interesting program m 1 s 
attempts to reduce rework and scrap and thereby cut losses. For one 
thing, attention is given to those areas where losses are greatest, jo 
determine these areas the company uses sample data to estimate the 
ratio of rework and scrap losses to the cost of production labor, buch 
data are available as part of the accounting records of the firm. 

To determine the numerator of this ratio (rework and scrap losses) 
the company’s records must provide information as a basis for the 
allocation of the costs of rework time and scrap to each operation m 
the production of airplanes. To determine the denominator of the 
ratio (production labor), records must be kept of the hours (or cost) 
of direct labor for the same operations. Without such internal data 
the program to control losses would not be possible. Such records ex¬ 
ist at the company because of close co-operation between the ac¬ 
counting and statistical divisions (another instance of the impor¬ 
tance of the principle of teamwork). 


To make decisions in our highly interdependent economy, man¬ 
agement often requires data external to its own enterprises. Thus, it 
may need information on the state of the national economy, on the 
entire industry, and other macroeconomic data. In recent decades a 
vast quantity of these data has been gathered. In this connection, 
the federal government is the largest producer of secondary data m 
the world. And others, including trade associations, private research 
organizations, financial services and journals, the United Nations, 
and state and local governments, contribute pertinent facts m many 
reports, periodicals, yearbooks, and other sources. 

External secondary data usually are provided by some central 
agency when the potential users of such data are not capable ot 
doing the study themselves. Thus, the federal government almost 
continuously provides a variety of information about the economy. 
This is, in many cases, in line with its responsibility of insuring eco¬ 
nomic stability in the country. But businessmen also make use of 
these published data on prices or inventories or sales. In part, the 
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government does the job because it is needed by individuals who 
could not do it themselves. 

An organization that conducts such studies must realize that its 
job is to put the consumer of these data in a position to act as though 
the consumer himself has made the study. In cognizance of this, the 
federal data-colleetion agencies often consult with local business 
groups and local government to determine the type of data, and the 
form in which these data should be, in order that they be useful as 
bases for action. Similarly, trade associations often set up special 
committees to study the type of data needed by their members. 

It is essential that management be aware of the types of external 
secondary data that are available. The use of such data avoids the 
making of decisions in the face of an excessive amount of uncer¬ 
tainty. In addition, it saves the firm the necessary expense of obtain¬ 
ing information already in existence through a study of its own. 

Example 4.3 Many residential areas in Nassau County, New 
York, are serviced only by electricity; no gas service is available. 
Nevertheless, residents in these areas regularly are swamped by mail 
advertisements, telephone calls, and door-to-door promotional cam¬ 
paigns to purchase gas refrigerators, dryers, and heating equipment. 

This needless expenditure of time and money could be avoided by 
a request to the Long Island Lighting Company, the supplier of gas 
and electricity to the area, for a map of the areas that do have gas 
facilities. Some companies apparently have been unaware that sec¬ 
ondary information on the areas serviced by gas is available. 

It would be futile to attempt to list and describe all available 
sources of secondary data here. The study of data sources is a part of 
many different fields of knowledge. Experts in each substantive field 
such as economics, marketing, management, and accountancy are 
acquainted with the sources of data relevant to their own particular 
field. However, some illustrations should point up the usefulness of 
secondary data in statistical decision problems. 

Example 4.4 The Disston Company is a manufacturer of indus¬ 
trial products such as metalworking items, machine knives, and 
woodworking items. The company determines the sales potential of 
its products in various geographical areas. It then uses this informa¬ 
tion as a guide to evaluate the effectiveness of a distributor in selling 
the company’s product in his area, and acts accordingly. 

To determine sales potential, the company relies on secondary 
data. Members of the research department have stated: “As a multi- 
line manufacturer of industrial goods . . . w T e cannot afford to spend 


94 


STATISTICS FOR BUSINESS DECISIONS 


[Oh. 4 


the time in thorough and exhaustive market analysis for each of our 
products individually; and as a result, the existence of detailed in¬ 
formation such as the Census of Manufactures is of enormous bene¬ 
fit to us.” 1 

The Census of Manufactures is a survey of all manufacturing ac¬ 
tivities in the United States. It is conducted under the auspices of the 
U.S. Bureau of the Census. Published results include extensive detail 
on employment, payrolls, production, and consumption of raw ma¬ 
terials for many industries. 

Thus, the Disston Company determines its area potential for ve¬ 
neer knives by consulting the Census of Manufactures to find out the 
quantity of veneer produced by industries who could use their knives: 
veneer plywood, cigar boxes, fruit and vegetable baskets, and wooden 
boxes. The demand for knives to cut veneer is then related to the 
quantity of veneer produced. External secondary data provide a ba¬ 
sis for decision. 

Example 4.5 According to one authority, 2 the criteria important 
to arbitrators in making their decisions about wage awards include 
the following: (1) changes in the cost of living and real wages, 
(2) budget studies, (3) wage comparisons, (4) changes in produc¬ 
tivity, (5) ability to pay, and (6) the economic environment. Al¬ 
though decisions do not rest solely on numerical information, such 
data, to be useful, must be timely and reliable. 

At times, primary studies may be conducted to provide informa¬ 
tion for arbitrators on some of these criteria. Often, however, reliable 
secondary data are available on many of these points. 

Eor example, wage data are compiled by the Bureau of Labor Sta¬ 
tistics and published in the Monthly Labor Review and in special 
publications. The same organization publishes comprehensive in¬ 
dexes showing changes in consumer prices. The National Industrial 
Conference Board, a private research organization, also compiles 
wage data and publishes a consumer price index. In addition, many 
trade publications (such as Editor and Publisher ), trade associa¬ 
tions, and state and local governments publish wage data. 

Data on worker’s budgets are prepared by the U.S. Department 
of Labor and are available for 134 cities. One study was made in 
1947, but the budget can be brought up to date by applying changes 
in the consumers’ price index and in taxes. 

Information on productivity is compiled by the Bureau of Labor 
Statistics of the U. S. Department of Labor for a number of impor¬ 
tant industries. 

Data on industry and total corporate profits (as one measure of 

1 Henry Schweitzer and Philip Torray, “Uses and Development of Industrial 
Potentials” in Harry Brenner (ed.). Marketing Research Pays Off (Pleasantville, 
N.Y.: Printers’ Ink Books, 1955), p. 305. 

2 Jules Backman, Economic Data Utilized in Wage Arbitration, Labor Relations 
Series (Philadelphia: University of Pennsylvania Press, 1952). 
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ability to pay) are compiled by: (1) the U.S. Department of Com¬ 
merce and reported in the Survey of Current Business; (2) the Fed¬ 
eral Reserve Board and reported in the Federal Reserve Bulletin- 
(3) the Bureau of Internal Revenue and reported in Statistics of In¬ 
come; (4) the Federal Trade Commission and Securities and Ex¬ 
change Commission and published in quarterly reports; (5) the Na¬ 
tional City Bank and reported in its Monthly Letter. Data on indi¬ 
vidual companies can be obtained from Moody’s and Standard and 
Poor s financial reports and from newspaper reports. 

. Current information on the general status of the economy is found 
in such sources as the Survey of Current Business and the Federal 
Reserve Bulletin . Historical series covering many phases of economic 
life may be found m the Statistical Abstract of the United States a 
publication of the Department of Commerce. 

Many other sources of secondary data also could be listed in con¬ 
nection with this decision problem. Knowledge of the availability of 
such information and the proper manner of relating observations to 
decisions in wage arbitration are an integral part of that branch of 
applied economics. 


Example 4.6 The Federal Trade Commission once issued a com¬ 
plaint against a flour company to the effect that a planned acqui¬ 
sition by the company would substantially lessen competition. 

In order to make a proper economic evaluation of the complaint it 
was necessary to determine what proportion of the total market for 
flour was controlled by the companies to be merged. Flour is of three 
types: family, bakery, and mix. 

To determine the necessary information the company used the fol¬ 
lowing sources: 

1. The total market for all types of flour was obtained from the 
Department of Agriculture’s yearly estimates of total civilian 
wheat flour consumption. 

2. Family flour consumption data were provided by estimates from 
the U.S. Department of Commerce. 

3. Bakery flour consumption was provided by the Census of Man- 

ufactures . 

4. Trends in brand share of the family and mix flour markets 
were obtained from studies conducted by the Market Research 
Corporation of America. 

5. Family and mix flour consumer habit patterns for rural and ur¬ 
ban areas were obtained from studies conducted for other pur¬ 
poses by the company. 

6. Company deliveries of family and mix flour were derived from 
the company’s internal records. 

7. Population estimates were obtained from the Bureau of the 
Census of the U.S. Department of Commerce. 


This il lustration is based on G. R. Detlefsen, 
mating/’ in H. Brenner (ed.), op. cit., pp. 57-67. 


"A Procedure for Market Esti- 
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8. National income data were obtained from the U.S. Department 
of Commerce. 

Thus, all the basic data were secondary in nature. In addition, ail 
the data with the exception of those concerning the company’s de¬ 
liveries and the figures from the Census oj Manufactures were derived 
from sample studies, and had to be evaluated as such. 

The principle of planning must be remembered in secondary 
studies as well as in primary studies. It is too often forgotten because 
secondary data usually are less expensive to obtain than primary 
data. Despite this fact, planning—relating decisions to potential 
observations—still is necessary. For one thing, the cost of gathering 
even secondary data is far from trivial. Furthermore, masses of data 
may be built up—data which confuse rather than clarify the prob¬ 
lems facing a decision maker unless planning is implemented. 

When we use secondary studies, care must be taken to verify that 
the data refer to the population of interest. If not, it must be de¬ 
cided whether or not the study provides an adequate working popu¬ 
lation. Other questions also must be raised: “Are the data sufficient to 
allow us to determine the decision-parameter(s) of interest? ^Can 
we have confidence in the data found in secondary studies? 

The auspices under which a study is conducted may be a general 
indicator of the reliability of secondary data. However, more precise 
criteria are needed to evaluate the quality of such data. Thus, as 
noted, whether a study is primary or secondary, we should conduct 
or evaluate it by certain basic statistical principles—those we are m 
the process of learning. 

4.3 Reasons for Sampling 

Whether data are primary or secondary, it is important to recog- 
nize that they may be either complete or sample information. In this 
book our emphasis is upon sampling since statistics is the subject 
which deals with making decisions in the absence of complete or 
population data. Thus, when the decision-parameter and, hence, 
the true state of the world in a business problem is unknown, certain 
statistical methods and theory may be used to provide a basis for 
decision. These techniques are based on the theory of sampling. 
Sampling is important in primary studies for reasons considered be¬ 
low. It also is important when we are dealing with secondary data; if 
such data are based on a sample study, they must be evaluated care- 
fully. 

When a statistician talks about a sample, he means a part of a sta- 



Ch. 4] 


SAMPLE SELECTION 


97 


tistical population. We have already discussed the latter concept in 
considerable detail; you should recall that a statistical population 
is a complete set of pertinent observations, either in the form of 
variates or attributes. 

A sample , therefore, simply is some of the pertinent observations 
from a specified population. 

Furthermore, sampling is the process by which the sample of ob¬ 
servations is obtained from a population. Thus, sampling involves 
at least two steps: selecting elementary units and making observa¬ 
tions on them. 

Remember, then, that strictly speaking, a sample refers to a 
group of observations. Thus, when a statistician “samples/' he 
“samples'" observations. However, at times, it is convenient and 
customary also to speak of a sample of elementary units because 
such units must be selected before we observe them. Therefore, as 
is often done, we may use the terms “sample” and “sampling” in 
two ways: to refer to a well-selected set of observations and to refer 
to a well-selected set of elementary units. 

In either event, however, sample information represents incom¬ 
plete information. Samples are parts of statistical populations which 
represent sets of complete information. But why and when should 
we ever utilize samples? Why settle for less than the whole popula¬ 
tion? There are a variety of reasons which arise individually or col¬ 
lectively in a large number of business problems. Probably the five 
most basic and most widely occurring reasons can be termed and 
listed as follows: 

a) Economy 

b) Timeliness 

c) Destructive nature of a test 

d) Accuracy 

e) Infinite population 

Before elaborating and illustrating these notions it should be 
stressed that not all of these reasons are equally important in every 
business problem. However, often at least one of them must be con¬ 
sidered. Also remember that other good reasons for sampling be¬ 
sides these may occur in certain problems. Now for explanations and 
examples of the five terms given above. 

Economy refers to the simple fact that obtaining sample infor¬ 
mation is often (but not always) less expensive than obtaining com¬ 
plete information. 
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Example 4.7 A soap company wishes to determine the distribu¬ 
tion of lengths of time a standard box of detergent lasts a housewife. 
The purpose of the study is to decide whether or not to market vari¬ 
ous sizes of boxes of the detergent, A survey which would include all 
housewives in the United States obviously would be far too expen¬ 
sive. However, a sample of housewives would provide a cheaper, yet 
reliable, basis for making the decision. 

Timeliness refers to the idea that obtaining sample information 
generally will result in a considerable saving in time spent on a sta¬ 
tistical study and will allow prompt and timely results. In many busi¬ 
ness problems this time element is a consideration of paramount im¬ 
portance. 

Example 4.8 As noted in an earlier section, the federal govern¬ 
ment and many private business organizations periodically publish 
numerical information on the United States economy. These statisti¬ 
cal data include employment, price, inventory, production, and pur¬ 
chasing figures for selected industries and for the whole country. Un¬ 
less the information can be brought out rapidly, the time period to 
which the data would refer makes them of little use as an aid in de¬ 
termining the level of the current economic scene or as a guide in 
short-run economic forecasting. Sampling, however, does allow for 
the timely collection of these data. 

In order to make observations in some business problems, par¬ 
ticularly those dealing with manufactured products, the elementary 
unit being observed must be destroyed or weakened. To test all units 
of the population would result in their destruction. It is obvious that 
sampling is a better alternative. 

Example 4.9 A firm produces radio tubes which it would like to 
sell with a guarantee of 100 hours. How can it estimate the number 
of tubes that would fail in less time and be returned? Testing all of 
the radio tubes until they failed is out of the question for there would 
be none left to sell. However, selecting and testing some of the tubes 
provides one basis for making a decision. 

The fourth point, accuracy , is a subtle one. It refers to the fact that 
very often sample information is more accurate than complete in¬ 
formation and, therefore, permits more valid decisions. This holds 
true whenever a sample study can be conducted more effectively 
than a complete count. 

For example, a statistical survey which involves personally con¬ 
tacting individuals to get information—perhaps market research in¬ 
formation—necessitates a group of competent and well-trained in¬ 
terviewers. A complete count ordinarily necessitates a large group 
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of interviewers; a sample necessitates only a small group This 
smaller group is usually easier to train; the larger may suffer from 
the effects of the law of diminishing returns. Thus, a sample survey 
with a small, compact, well-trained set of interviewers usually will 
be able to get more valid information from the public. The result: 
more accurate results and more valid decisions. 

The last argument for sampling, an infinite population, should be 
a fairly obvious point. Generally, it refers to the fact that when we 
are dealing with an infinite population, as in an analytical study 
we must settle for a sample from the population. 

Example 4.10 A manufacturing process produces coils. Suppose 
the production manager of the company must decide on a setting for 
a machine. Since his decision is about the process, the population of 
study should be defined as the totality of observations (e g defec¬ 
tive versus nondefective) on all coils that the process could turn out 
under a given set of operating conditions. Thus, note that any statis¬ 
tical study of the process would be analytical and that the popula¬ 
tion is infinite. p 

It follows that a complete count is impossible. Observations on 
any one or two or three day’s product would represent only partial 
information about the population. If any information is to be the 
basis for decision, it must be sample information. 

In concluding our remarks on reasons for sampling it should be 
remembered that in some problems a statistician will not recom¬ 
mend the use of sample data. Rather he might say: “Make your de¬ 
cision without the benefit of observations (as in Example 4.1)_it 

is not feasible to attempt to get any information." 

. 0r > m the other extreme type of problem, he might say, “Let's uti¬ 
lize a complete count and not a sample for this problem—all things 
considered, it will be more economical." Why should he say such a 
thing? First of all, the use of samples in decision making always in¬ 
volves some risks—risks of making an incorrect decision because 
complete information was not available. Thus, a complete count is 
preferable if no risks can be tolerated. 

Example 4.11 As you undoubtedly know, the number of Con¬ 
gressmen from each state in the House of Representatives is based on 
the population of the states. This is required by the Constitution 
(Article I, bection 2), and provision is there made for a decenniel 
census of the population in order to so apportion representatives in 
Congress among the states. Thus, the Constitution provides that an 
enumeration of the population be made within 3 years after the 
first meeting of the Congress of the United States and within everv 
subsequent term of 10 years. 
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Politically, it would not be feasible to base this apportionment de¬ 
cision on sample data—no risk of an incorrect decision due to incom¬ 
plete information would be tolerated. 

Secondly, a complete count is preferable to a sample if a popula¬ 
tion is small This is so because the difference between the cost of a 
sample and the cost of a complete count in such a situation is also 

small 

Notwithstanding the fact that some decision problems can be 
solved either without the benefit of observations or with complete 
information, the large majority are best solved through the use of 
sample data. This is true when we use primary data, and it is equally 
true when we use secondary data. The next main questions in our 
studies, then, are: “How should sample data be obtained?” and 
“What is a well-selected sample?” 

4.4 All Possible Samples 

We now should be clear on the idea that sample information is 
partial information and that this partial information is obtained 
from a group of only some (not all) elementary units. However, it 
is important to recognize that in any given problem there are many 
different samples of elementary units that might be selected for 
study. And it is equally important to recognize that in most prob¬ 
lems only one sample actually is selected. 

Before discussing procedures for selecting that one sample, how¬ 
ever, let’s consider all possible groups of elementary units. Suppose, 
for example, that you are interested in testing a group of 6 machines 
—call them A, B, C, D, E, and F. How many different samples of 
2 are possible? One way of answering this question is through an 
enumeration of all possibilities. Thus, we find the following combi¬ 
nations : 

AB BC CE 

AC BD CF 

AD BE DE 

AE BF DF 

AF CD EF 

These 15 combinations exhaust the set of all possible samples of 2 
machines. However, in any test of a sample of 2 machines, one and 
only one of these combinations actually would be selected. 

In general, suppose that we have a set of N elementary units rep¬ 
resenting a population. How many possible sample of n units are 
there? For example, we know that the answer to this question for 
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N 6 and n — 2 is 15, and we might be able to answer it for other 
specific values of N and n on the basis of similar enumerations. How¬ 
ever, the general answer can be provided only by a branch of mathe¬ 
matics called combinatory analysis. 

Thus, to express this answer in terms of a formula we must intro¬ 
duce a type of number called a factorial. For example, “5 factorial” 
is written 5! and is equal to5X4X3X2Xl= 120. Other ex¬ 
amples of factorials are: 

1! - 1 

2! = 2 X 1 = 2 
31 = 3X2X1 = 6 
41 = 4X3X2X1 =24 
10! - 10 X 9 X ... X 2 X 1 = 3,628,800 . 

As you can see, factorials increase in value very rapidly. If you need 
further proof consider the facts that 

20! - 20 X 19 X ... X 2 X 1 = 2,432,902,008,176,640,000 

and 100! is almost equal to the number 1 with 158 zeros after it! 

We need not consider the values of factorials for anything other 
than positive integers and zero. Zero factorial, incidentally, is writ¬ 
ten 0! and is, by convention (and for convenience) taken equal 
to 1. (Thus, 0! = 1!, since both equal 1.) 

The number of possible samples of n elementary units from a 
population of N elementary units may be expressed in terms of fac¬ 
torials. It equals 


Nl 

n\ (N — n)\ ' 

For example, with N — 6 and n = 2, we have 

5! ^ 6X5X4 X3X2 XI _ 720 
2! 4! 2 X 1 X 4 X 3 X 2 X 1 ~~ 48~ “ 15 * 

Similarly, as you may verify: 


N 

n 

Number of 
Possible Samples 

6 

3 

20 

10 

4 

210 

20 

6 

38,760 

100 

3 i 

161,700 


That is, there are 20 possible samples of 3 from a population of 6, etc. 
One last detail—-suppose, as in an analytical study, a population 
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is infinite; that is, N is infinite. Then, there are an infinite number 
of possible samples of a given size. 

Now, with the idea of all possible samples in mind, in the next sec¬ 
tion let’s return to the problem of how we actually select one of these 
possibilities in a practical problem. 

4.5 Probability, Judgment, and Convenience Sampling 

The key question to be answered in this section is: “What proce¬ 
dure should be used to select one of the many possible samples of ele¬ 
mentary units that are available in a given study?” 

There are many different ways of selecting a sample. It is con¬ 
venient, however, to divide these many ways into three categories, 
probability sampling, judgment sampling, and convenience sam¬ 
pling. See if you can attach these suggestive terms to the methods 
of sample selection which are given in the following illustration. 

Example 4.12 An association of purchasing agents, located in 
New York, periodically conducts a sample survey of its members to 
determine their buying policies and their expectations of changes in 
the price level. It reports the results of the study to. all members, 
some of whom use the information as a basis for modifying present 
purchasing policy. 

There are, of course, many ways in which this type of survey might 
be carried out. Among them are included the following: 

a) The officials of the association might select 200 of its members, 
including those whom they considered “typical” or “representative’ 
of the entire membership. This sample of 200 members then would be 
asked to provide the desired information, which would be considered 
“representative” of the population (which involves all members). 

b) Information might be requested only from those members 
who worked in the New York area because they could be contacted 
quickly and most easily. Again this information would be a sample 
from the population of interest. 

c) The officials of the association might write each member’s name 
on a separate slip of paper, place all slips in a hat, mix them thor¬ 
oughly, and select, say, 200 at random. The members so selected 
would be contacted to provide the sample information. 

If you haven’t guessed, method (a) in this example is an illustra* 
tion of judgment sampling (or as it is also called, purposive sam¬ 
pling). This method of sample selection calls for an expert in the 
subject matter—e.g., purchasing or economics to decide on which 
elementary units should be included in a study. Thus, for all judg¬ 
ment samples, the judgment of an individual determines the ele¬ 
mentary units that should be included in a sample. Usually this judg- 
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ment is exercised by attempting to include what the expert con¬ 
siders “typical” or “representative” units. 

Method (b) represents convenience sampling. Expert judgment 
is not used to decide on the elementary units to be included in a con¬ 
venience sample—convenience is the only criterion for selection. 
Any sample derived in this way is, therefore, only a convenient slice 
of a population—this slice sometimes is called a chunk to distinguish 
it from a well-selected sample. Other examples of chunks are rep¬ 
resented by (1) interviews of people sitting in a park because they 
happen to be there, (2) the selection of one’s class of students be¬ 
cause they are handy, and (3) comments by customers who com¬ 
plained about some product during the past month because their 
names are readily available. 

In contrast to the use of judgment, or the criterion of convenience, 
in the selection of a sample is probability sampling (or random sam¬ 
pling). For the moment we may view a probability sample as one 
which is selected by some random process. A crude and imperfect 
example of a probability sample is illustrated by (c) in Example 
4.12. More precise and reliable methods of selecting probability 
samples will be considered later. In any event, probability sampling 
involves giving every elementary unit some known chance of ap¬ 
pearing in a sample. 

Throughout this course we will center our attention on probability 
sampling, for probability sampling is of paramount importance in 
the field of statistics. Statisticians regard probability sampling as the 
best method of sample selection because it is the only method of 
sampling which can be objectively evaluated. (As we will soon see, 
this evaluation is accomplished through probability theory.) Thus, 
their train of thought runs along the following lines. How do we use 
samples? We use them to make decisions when we are uncertain 
about the value of a decision-parameter. Since there are many pos¬ 
sible samples, the one actually selected may be good or bad, and we 
rarely can determine which it is. If we can, there ordinarily is no 
need to sample. Thus, a sample may or may not lead to a correct 
decision—there are always certain risks of making incorrect deci¬ 
sions whenever courses of action are taken on the basis of samples 
rather than on the basis of complete information. Only the use of 
probability sampling permits the measurement of these risks. 

It still must be remembered, however, that judgment sampling 
and convenience sampling have more than a few legitimate uses. For 
one thing, judgment sampling often is preferable to probability sam- 
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pling if only a small sample is to be used as a basis for decision. How 
large is small? This depends on the problem situation. For example, 
if eight or ten counties were to be selected from all counties in the 
United States for a study of the condition of county roads, it prob¬ 
ably would be best to select the samples by using (an expert’s en¬ 
gineering) judgment. Experience has led statisticians to expect a 
smaller risk of error with judgment sampling than with probability 
sampling in such a case. 

Example 4.1S Suppose that a legislative committee conducts an 
investigation of state taxation of interstate commerce. They un¬ 
doubtedly would want to check and to study the state tax laws of all 
states as a basis for deciding on what type of federal legislation, if 
any, to recommend to Congress. However, if a stop-gap law is needed 
quickly, a sample of one or two or three states’ tax laws might be 
selected for immediate study. If so, it would be best to select those 
few states on the basis of a tax expert’s judgment as to which laws 
were representative of all the state statutes. 

However, judgment sampling often is dangerous, and, unfortu¬ 
nately, it is used far more frequently than it should be in far too 
many types of decision problems. 

Example 4.14 One inadequacy of judgment sampling is pointed 
up by the results of a sampling experiment conducted by an English 
statistician, Frank Yates. 4 A group of about 1,200 stones, varying in 
size, was spread out on a table, and 12 persons were each asked to 
select 3 samples of 20 stones, which were to represent, as nearly as 
possible, the size distribution of the whole collection. 

The results of the experiment showed that there was a consistent 
tendency, common to nearly all persons, to select a sample which 
overestimated the average size of the stones. In fact, only 6 of the 36 
estimates were smaller than the true average weight, and 3 of these 
estimates were made by a single observer. 

Another interesting aspect of such a judgment sampling procedure 
is the consistent tendency to underestimate the variability in the dis¬ 
tribution. Thus, most of the persons selected stones as near their con¬ 
cept of the average size as possible, with a much smaller proportion 
of extreme sizes than in the whole population. 

It must be remembered that judgment is not unimportant in sta¬ 
tistics—we already have used it in several ways, and we will use it 
in many others. For example, judgment is quite necessary in defining 
a problem, in defining a statistical population, and in specifying the 
decision-parameter. However, except in rare cases, judgment is un- 

4 See Frank Yates, Sampling Methods for Censuses and Surveys (London: 
Charles Griffin & Co., Ltd., 1949), p. 12. 
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necessary and even undesirable as a basis for deciding which elemen¬ 
tary units should be selected. Hence, except where noted, selection 
by probability sampling is to be preferred to judgment selection. 

Convenience sampling, too, often is misused. Convenience sam¬ 
pling may lead to reliable results, or it may not. As with judgment 
sampling, there is no way of evaluating it. Convenience sampling 
often is particularly dangerous when one assumes the sample so 
selected will be "representative" of the population. 


Example 4.15 One once overly popular method of sample selec¬ 
tion m marketing research is called quota sampling . A quota sample 
is, in everyday language, supposedly, a “perfect cross section” of the 
population. Unfortunately, there are several defects in quota sam¬ 
pling, and many experiences have shown it a complete failure. 

In the design of a quota sample, certain quotas are set—for exam¬ 
ple, for the number of males and females to be included in the sam¬ 
ple, for the number of urban and rural families to be included, for the 
number of young, middle-aged, and older persons to be included, etc. 
Thus, the sample is distributed over various groupings so that it will 
agree almost perfectly with known data in regard to such criteria as 
area, sex, age groups, economic levels, etc. 

^However, it is important to recognize that this will not guarantee 
a “good” sample. In fact, this type of sample may fail completely to 
correspond to the population of interest- with respect to the charac¬ 
teristics that the study is supposed to measure (even though it cor¬ 
responds in several characteristics already known and used to con¬ 
struct quotas). 


Once quotas are set, they then often are given to interviewers to 
fill as best they can by searching out people with the appropriate 
characteristics. To this extent, quota sampling is “convenient.” Thus, 
despite its pretense at careful planning, the actual selection of ele¬ 
mentary units is left to interviewers and their convenience. 


Example 4.16 The experience of the Literary Digest (a once- 
popular but now defunct magazine) in predicting the outcome of the 
1936 presidential election is cited widely in statistical circles as an 
example of the danger of interpreting any sample as reliable. 

The previous political predictions of the Digest poll had been good 
at times. For example, in 1932 it predicted the vote for Franklin D. 
Roosevelt within 1 percentage point. However, in 1936 it predicted the 
defeat oi Roosevelt and the election of his Republican opponent, 
Alfred M. Landon. Actually Roosevelt won, receiving 60 per cent of 
the votes cast. The Digest poll had failed, to pick the winner—what 
was worse the poll made an error of 19 percentage points in predict¬ 
ing Roosevelt’s vote. 

The DigesTs predictions were based on a convenience sample_• 

and on a rather large one at that. From a mailing list of about 10 
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million names, more than 2 million ballots were received and in¬ 
cluded in the poll. The Digest settled for the returns received—they 
were conveniently available. This illustrates the point that even a 
large convenience sample may yield poor results. Increased sample 
size does not cover up the inadequacies of a poorly selected sample. 

Thus, statisticians regard convenience sampling as useful only 
when one needs rough estimates, or when a probability or judgment 
sample is impossible, or when an important decision does not hinge 
on the data to be collected. In addition, convenience sampling is par¬ 
ticularly useful when the purpose of a study is only to produce ideas 
rather than to make decisions. 

Example 411 A baby food products manufacturer continuously 
engages in research to bring the country’s infants better puddings, 
cereals, strained fruits, etc. In this connection, parents are contacted 
and they and their children are requested to try certain foods. Par¬ 
ents are asked to comment on the foods. Of course, not all parents 
(nor children) are willing to try these new products and provide 
comments—hence, those that do represent only a convenience sam- 

P Fortunately, then, important marketing decisions are not made 
on the basis of this type of information. However, since the purpose 
of such a study merely is to develop ideas on color, appearance, fla¬ 
vor, etc., of baby foods, a convenience sample suffices. 


Example 418 Convenience samples. are used extensively m 
motivation research. Such studies as applied to marketing generally 
attempt to determine why people buy. They involve the use of such 
psychological techniques as depth interviews, word association, nar¬ 
rative projection, cartoons, and personality tests. Motivation re¬ 
search stresses the importance of individual and social needs as de¬ 
terminants of buying power in addition to economic and physical 
H66(ls 

Motivation research very often is used for exploratory purposes 
to give ideas to management as to which alternative courses of ac¬ 
tion are available. In such instances convenience samples are very 
effective. However, when the function becomes more than that, of 
exploration of possible alternative courses of action, the selection 
of respondents and sample size should be determined on the basis of 

probability sampling theory. 5 . . , 

Recently a motivation research study to determine why people 
prefer particular makes of automobiles was conducted to consider 
possible contents of automobile advertisements. The study indicated 
the possible conclusion that the purchase of a car hinges more on 


5 Joseph W. Newman, 
ton: Harvard University, 
Research, 1955). 


Motivation Research and Marketing Management ( Bos- 
Graduate School of Business Administration, Division of 
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suiting the personality of the car to the personality of the buyer than 
on the engineering features of the car. The report thus focused atten¬ 
tion on alternative forms of advertising to project the personality 
of automobiles. 

In short, then, probability sampling—selection by means of some 
random process—enables one to measure risks of incorrect decisions 
objectively. Since statisticians consider this of paramount impor¬ 
tance, a good part of our study of statistics will center on it. Judg¬ 
ment and convenience sampling will be dealt with only insofar as we 
need contrast them to probability sampling. However, for this reason 
you should keep them in mind. 

Also remember that no matter whether a sample is selected by 
judgment, by convenience, or by a random process, the principle of 
planning must still be applied. Careful consideration of how the 
sample information will be used is invariably necessary to prevent 
the collection of unnecessary data. 

You may have noticed that so far we have not considered how one 
should decide on the size of a sample. Often the number of elemen¬ 
tary units to be selected for study is governed by the amount of time 
and money management is willing to spend to provide a basis for 
decision. More objective ways of determining sample size also are 
available when we use probability sampling, but these depend on 
theory we have yet to develop and must be discussed later. 

Let’s now turn to a fuller discussion of probability sampling. 

4.6 Simple Random Sampling 

Probability sampling, as was noted in the last section, is a method 
of obtaining partial information. It involves selecting elementary 
units by a random process. When probability sampling is used, we 
can determine the chance that a given elementary unit will be se¬ 
lected for study and this, in turn, will enable us to determine risks 
of incorrect decisions. 

# ^ mus f be noted that there are many different types of proba¬ 
bility samples, just as there are many different ways of exercising 
judgment or applying the criterion of convenience in those sampling 
procedures. Thus, in the remaining pages of this chapter we will con¬ 
sider several different methods of probability sampling and how we 
can apply them in administratively efficient ways. As a start in 
this direction, suppose that we consider the simplest, most basic, 
and most common type of all probability sampling procedures— sim¬ 
ple random sampling . 
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Simple random sampling may be defined as a method of selection 
which gives every possible sample of size n the same chance of being 
the sample chosen in a study. 

For example, suppose we have a population represented by five 
college professors and including their subjects of specialization: eco¬ 
nomics, psychology, sociology, history, and government. All possible 
samples of two include the following (in terms of first letters). 

EP EH PS PG SG 

ES EG PH SH HG 

If we use a procedure of selection which gives each of these 10 com¬ 
binations an equal chance to be the one sample selected, we are using 
a method of sampling called simple random sampling. 

How, in a practical problem, can a simple random sample be se¬ 
lected? Are there any ways of carrying out the procedure we have 
defined? There are—in fact there are a variety of ways. In the next 
section we will turn to a more sophisticated method, but for the 
present consider a procedure similar to one suggested earlier: Iden¬ 
tify every possible sample combination (AB, AC, AD, etc.) on a 
slip of paper, throw all the slips in a hat, and select one of them at 
random. Such a procedure, if carefully applied, should give every 
possible combination of elementary units the same chance of being 
the one sample realized in a given problem. Hence, it should meet 
the definition of simple random sampling given above. . 

However, one refinement in our procedure for selecting a simple 
random sample is desirable even when we sample by drawing slips 
of paper from a hat. Rather than choosing combinations of elemen¬ 
tary units, we may choose units one at a time. Thus, to select a sim¬ 
ple random sample it may be shown that all we need do is use a 
method of selection which gives every elementary unit the same 
chance of being the first unit selected, every remaining elementary 
unit the same chance of being the second unit selected, every re¬ 
maining elementary unit the same chance of being the third unit 
selected, and so on. 

Ordinarily it is easier to select a simple random sample by picking 
elementary units one at a time. For one thing, this allows us to by¬ 
pass the problem of enumerating all possible samples, the number 
of which is usually very large. 

Therefore, it is important to recognize that under this alternative 
procedure, it may be shown that every possible sample has a known 
(and equal) chance of being the one sample actually selected. In 
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the next chapter we will express these known chances in numerical 
terms—in terms of a probability measure. This, in turn, will allow 
us to calculate risks of making incorrect decisions on the basis of 
sample information. 

, tIle present simply note that this idea of “calculable risk” is 
the advan tage of any method of probability sampling. This is impor¬ 
tant because there are several popular misconceptions about prob¬ 
ability sampling. For example, probability sampling does not guar¬ 
antee a “fair,” or a “typical,” or a “representative” sample, though 
some people loosely and incorrectly describe probability sampling 
by these terms. For one thing, such terms are vague and usually 
meaningless. For another, no method of sampling can guarantee fair 
or typical or representative samples. Generally speaking, there is no 
way of evaluating samples. However, with probability sampling we 
can evaluate the sampling procedure in terms of the risks it involves. 
This idea of “calculable risk” is its important advantage. 

The misconception of fairness is particularly prevalent in the case 
ot simple random sampling and stems from the fact that simple ran¬ 
dom sampling is thought of as a procedure which gives every ele¬ 
mentary unit the same chance of being selected. First of all, it must 
e recognized that many other methods of probability sampling also 
give every elementary unit the same chance of appearing in a sam¬ 
ple. For example, with a population of six elementary units, A, B, 

rj’ ]D ’ E ’ and E ’ we mi S ht P^k one of the three combinations, AB 
CD, or El at random. You can see this procedure gives every ele¬ 
mentary unit an equal chance. However, it does not give every com- 
bmation of elementary units the same chance. (Several including 

A u A i?v AE ’ etc ’’ would have no chance.) Thus, the procedure is a 
probability sample of some sort, but it is not a simple random sam- 
pie. 

. Secondly, note that some methods of probability sampling, by de¬ 
sign, may give some elementary units a better chance of being se¬ 
lected than others. For example, in a sample study of profits of 
motion-picture theaters one might wish to give Grauman’s Chinese 
Theater and Radio City Music Hall a larger chance of appearing in 
a sample than Neighborhood Theater or Hometown Cinema. The 
important characteristic of probability sampling is the fact that 
every elementary unit has an ascertainable chance of being selected 
—these chances may be equal or unequal. 

In our discussion of probability sampling, we implicitly have lim¬ 
ited ourselves to finite populations, i.e., enumerative studies. Now, 
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the matter of probability sampling from infinite populations (for 
analytical studies) deserves some attention. 

One of our standard references for analytical studies has been a 
decision problem dealing with a production process. Since in this 
type of problem a manufacturer is interested in taking action on a 
process, the population of study is infinite. It is represented by the 
totality of product the process could produce if it continued to oper¬ 
ate indefinitely under a given set of conditions. Now, we have de¬ 
fined probability sampling as the random selection of part of a 
population so that every elementary unit has a known chance of 
appearing in the sample. What does this mean for an analytical 

study? 

First of all, you must realize that in sampling from such an in- 
finite population we are limited to selecting from one or more days’ 
output. That is, even though we want to select from the total popu¬ 
lation (all possible product the process could produce), we opera¬ 
tionally are limited to a part of this total population (what has been 
produced today or over the last few days). How do we know, then, 
that we have a probability sample from the total population when 
we are not controlling the selection procedure? 

The answer to this question depends on the nature of the produc¬ 
tion process. Experience may have taught us that such a process it¬ 
self operates in a random manner. Thus, we may assume a produc¬ 
tion process generates a probability sample of all product it could 

produce. 

Thus, even if we were to take a complete count of all product pro¬ 
duced on a given day, we would get a probability sample from the 
total population. At least we would if our assumption that the 
process is generating some type of probability sample is a good as¬ 
sumption. Furthermore, if we were to select part of any day s out¬ 
put we also would wind up with a probability sample of the total 
population. Again, however, this result depends upon our assump¬ 
tion that the process is operating randomly. . 

These assumptions may be tested, in some cases, by statistical 
methods on the basis of our experience with the process under study. 
That is, for a given type of probability sample our sample should 
behave in a certain way, and empirically we can see whether or not 
it is behaving in that manner. For example, in flipping a coin, ob¬ 
serving HTHTHTHTHTHT would make us suspicious that 
our flipping process was not generating a simple random sample- 
each observation seems to depend on the previous one. 
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4,7 Random Number Tables 

The only method we have considered for drawing a probability 
sample thus far has involved slips of paper and a hat. However, this 
procedure should not be used in practice because it presents several 
difficulties. First of all, the slips of paper representing the various 
combinations of elementary units or the elementary units them¬ 
selves should be almost exactly the same size and weight. They can¬ 
not be too heterogeneous; otherwise their respective chances of 
being selected would vary significantly. Furthermore, the mixing of 
the slips before one is selected must be more thorough than most 
people can imagine. Unless everything is ideal in every sense of 
the word, we would not be using a procedure which met our defini¬ 
tion of simple random sampling. What’s worse, we might end up 
thinking that we were selecting a simple random sample and our 
thinking would be all wrong. 

To top everything else, writing each possible sample or even writ¬ 
ing elementary units on separate slips of paper in order to select a 
sample is a rather cumbersome and costly task—especially when 
the total number of elementary units is rather large. Then, too, a 
rather large hat would be needed for most practical problems. Thus, 
a more economical procedure is absolutely necessary. 

Fortunately, statisticians have developed such a procedure. It in¬ 
volves the use of something called random numbers . In this connec¬ 
tion the following quotation is pertinent: 

It was recognized by some workers early in the development of statis¬ 
tics that many of the methods which may reasonably be expected to 
yield random samples are, in fact, biassed. The evidence which has since 
accumulated supports that view. It seems that wherever any human ele¬ 
ment of choice is allowed free play, as, for instance, when an observer 
selects “at random” by eye a number of plants in a field, or draws cards 
“haphazard ly” from a pack, bias inevitably creeps in. There may be in¬ 
dividuals whose psychological processes are so finely balanced that they 
can deliberately choose random samples; but they are the exception, not 
the rule, and the ordinary statistician dare not use his own nervous 
mechanism as a random selector. 

There is thus a need for an objective method of drawing random sam¬ 
ples, which form the basis of a large section of theoretical and practical 
statistical studies. In the search for a reliable method many devices have 
been tried, but those which offer the greatest a priori prospect of random¬ 
ness are unfortunately difficult to wield in practice . . . The method of 
Random Numbers is the most reliable and often the speediest method of 
drawing random samples . . . Other methods may be quicker and just 
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as adequate for particular [problems] . . . but no other method is known 
which can in practice be used without misgiving on any [problem]. 0 

Now, perhaps it is best if first we let some of these random num¬ 
bers speak for themselves. A few of them are given in Appendix C. 
This group represents a section from a much larger table, but they 
will suffice for our present studies. Look them over. 

The numbers you have observed are called random because they 
typify exactly what is meant by the word. They represent a thorough 
scrambling of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9. No matter how 
you read them—sideways, down, upside down, or backwards—they 
still seem to have that random property. It seems that every digit is 
independent of the preceeding one, of the succeeding one, of the 
upper one, and of the lower one. These numbers are typical of what 
we mean by random phenomena. 

There are several tables of random numbers which are used ex¬ 
tensively in statistical practice. 7 Appendix C reproduces a portion of 
one such table. The digits given come from a very much larger table 
compiled for use in sampling studies by The Rand Corporation. 
Tables of random numbers have been produced in a variety of ways. 
The mathematical and technical details involved cannot be con¬ 
sidered here. Needless to say, one way of not constructing such a 
table is to write down digits as they come into your mind. Such a 
tabulation undoubtedly would be biased in several ways—for ex¬ 
ample, by a preponderance of your favorite digits or your lucky num¬ 
ber. 

How can we determine whether the numbers in a table of random 
digits are actually in a random order? One answer is that the random 
nature of such a table of digits can be determined empirically. First 
of all, there are various statistical tests that can be applied to see 
if all the digits occur about the same number of times, and that 2's 
do not have a habit of following 6's, etc. Secondly, there is always 
the pragmatic test—do these numbers serve the purpose we wish 
them to? Well, to answer this let's see how we might use them. 

In general, we wish to use random numbers to help us in select¬ 
ing probability samples. Suppose, then, we first consider how we 
would use them to select a simple random sample. Variations of this 

e M. G. Kendall and B. Babington Smith, “Tables of Random Sampling Num¬ 
bers,” Tracts for Computers No. XXIV (London: Cambridge University Press, 
1939), p. vii. 

7 The earliest published table (1927) was compiled by L. H. C. Tippett. From 
British Census Reports, Tippett selected 40,000 digits at random and combined 
them into 10,000 groups of 4 digits each. 
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procedure also serve for many other types of probability sampling. 

The first thing we must do in using random numbers is to eohsider 
the frame, that important ingredient of many statistical studies that 
we considered in Chapter 2. A frame is a working definition of a 
population. It identifies all of the elementary units representing the 
population of study in terms of a list or map or in some other con- 
Crete way. 

Given a frame, we must assign a unique serial number to each and 
every elementary unit. This identifies the elementary units numeri- 
cally. For example, if we were to sample from a list of 10 warehouses 
(A, B, . . . , I, J) we could assign A the serial number 1, B the 
serial number 2, . . . , I the serial number 9, and J the serial num- 
ber 0. 

One thing of importance to remember in assigning serial numbers 
is that they all should be different. Another is that they should all 
include the same number of digits. Thus, suppose we had a frame 
listing 678 warehouses. We would have to assign 678 different serial 
numbers to the elementary units. Since the largest serial number to 
be assigned is a 3-digit number, each serial number may be taken as 
a 3-digit number. Thus, we may assign the first warehouse the num¬ 
ber 001 (not plain 1 since it must have 3 digits), the second 002, 

. . . , the thirty-third 033, . . . , the one hundred and twelfth 112 
. . . , the last, 678. 

Once serial numbers are assigned to the elementary units included 
in the frame, we may use random numbers to select the elementary 
units to be included in a sample. For example, to select 5 ware¬ 
houses from the 678 noted above, we turn to our table of random 
numbers and read off 3-digit numbers. In practice, one usually will 
pick up where he left off the last time he used the table. However, 
since this is our first experience we may start at the beginning and, 
for convenience, read digits in groups of 3 down the table. Refer to 
Appendix C and observe that this means we read off 931, 282 218 
220, etc. 

These numbers identify certain serial numbers and, hence, certain 
warehouses. Thus, the warehouses corresponding to these random 
numbers are selected for our sample—those selected are observed 
for the characteristic of interest. How many warehouses should be 
selected? The first 5 warehouses we can identify. Why 5? Because 
we want a sample of 5. In general, if we want a simple random sam¬ 
ple of size n, we continue using our table of random numbers until 
we identify n different elementary units. 

'.WHliM UtifiM 
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However, as you may have asked: “What about that 931 above?” 
Since we had only 678 warehouses, no warehouse was assigned that 
serial number. We may skip any set of digits that has no elemen¬ 
tary unit associated with it. Furthermore, if any serial number comes 
up twice, we skip it the second time—we ordinarily should not want 
to include any elementary unit more than once in a sample. 

Although we have used our random number table from the top, 
since the numbers are in a completely random order we might have 
started anywhere at all in the table. However, as was noted, in prac¬ 
tice one usually should pick up where he left off; he makes note of 
where he finishes using the table for one sample and proceeds from 
there on his next sampling problem. 

Thus, the use of tables of random numbers to select a simple ran¬ 
dom sample (or some other type of probability sample) generally 
involves two distinct steps: (1) assigning serial numbers to every 
unit in the frame, and (2) selecting a sample of these units by refer¬ 
ence to the table of random digits. In various problems, of course, the 
ways in which these steps may be taken vary according to the cir¬ 
cumstances of the situation. 


Example 4.19 In defining and classifying jobs, pertinent informa¬ 
tion includes the proportion of time spent by present employees in 
various activities. One statistical application which arises in this 
connection is called work sampling. Work sampling involves the se¬ 
lection of times, at random, and observing the activities of the em¬ 
ployee under study at the times selected. 

The sample of times may be selected by the use of random num¬ 
bers. For example, suppose an employee’s activities from 8:00 a.m. 
to noon and 1:00 p.m. to 5:00 p.m. are to be sampled on a given day. 
First, each minute of time may be assigned a 3-digit serial number, 
perhaps as follows (note that the list of times provides a frame): 


Time 

8 : 00 - 8 : 59 . 

9 : 00 - 9 : 59 . 

10 : 00 - 10:59 

11 : 00 - 11:59 

1 : 00 - 1 : 59 . 

2 : 00 - 2 : 59 . 

3 : 00 - 3 : 59 . 

4 : 00 - 4 : 59 . 


Serial Number 
.. 800-859 
... 900-959 
... 000-059 
... 700-759 
... 100-159 
... 200-259 
... 300-359 
... 400-459 


Note also that the use of 000-059 for 10:00-10:59 and 700-759 for 
11:00-11:59 avoids 4-digit serial numbers and other 3-digit serial 
numbers already used. 

Random numbers then would be used to identify the times to be 
selected. For example, using a table of random numbers we may find 
the digits 413, 952, 886, etc. The 413 identifies 4:13 p.m. which is one 
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time for the sample; the 952 identifies 9:52 a.m. which is a second 
time for the sample; the 886 identifies nothing and is skipped; etc. 


Example 4.20 As was pointed out in an example in an earlier 
chapter, wool ordinarily is imported in highly compressed bales. Sup¬ 
pose then, that an importer, facing the decision problem of how to 
grade a bale, wishes to sample the wool to provide a basis for his 
choice of an alternative. How might we select such a sample? 

One solution is to subdivide (on paper) the bale into “cores” about 
4 inches in diameter and half the depth of the bale in length. Serial 
numbers then should be attached to each core. For example, the 2 
sides of the bale might be drawn as follows: 




BACK 
OF BALE 


Note that this diagram represents a frame and that each “core” 
is identified by a pair of serial numbers-—e.g., (6,10) is marked on 
the diagram above. A sample of “cores” would be selected by reading 
out pairs of random numbers from a table. Those points that fall in 
the sample would then be cored out and tested as a basis for the 
classification decision. 


In many accounting and auditing applications by contrast with 
the situations just described, the problem of assigning serial numbers 
is no problem at all. Many elementary units, such as checks, tickets, 
invoices, vouchers, etc., are numbered as part of regular accounting 
operations. When these prenumbered units represent a population 
of interest, the first task in selecting a probability sample is already 
solved, and the selection of the sample is greatly simplified. 

Is the use of random numbers in situations such as those we have 
covered in this section a good procedure for selecting probability 
samples? This question can be answered empirically as well as ab¬ 
stractly. If the repeated use of such tables shows us that the chances 
of serial numbers appearing are, for all intents and purposes, equal, 
then we may conclude that the use of random numbers is justified. 
Experience has taught statisticians that this, in fact, is true. Hence, 
the use of tables of random numbers, as we have sketched it, brings 
us very close to insuring the theoretical definition of simple random 
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sampling (or some other type of probability sampling) in practice. 
Furthermore, it usually is a rather simple way of achieving this goal. 

4.8 Other Types of Probability Samples 

In this text we shall concentrate on the use of simple random sam¬ 
pling as a means of obtaining partial information. One reason for 
this is its relative simplicity. Another is that this type of probability 
sampling is basic to an understanding of almost all other sampling 
procedures—many other methods merely are variants or combina¬ 
tions of simple random sampling. Lastly, simple random sampling 
itself is an important and frequently used sampling procedure. 

However, to point up that there are other types of probability 
samples, we will devote this section to a brief outline of four other 
methods: (a) stratified sampling, (5) cluster sampling, (c) double 
sampling, and (cf) sequential sampling. Most of the details of the 
theory and the exact applications of these methods are out of the 
range of this book. Thus, we should regard this survey only as a 
quick look at what the above terms refer to. 

All of the aforementioned methods of sampling are methods of 
probability sampling. That is, despite their differences, each pro¬ 
cedure is designed to give each and every elementary unit a known 
chance of appearing in a sample. Therefore, all should be drawn by 
using a randomizing mechanism such as a table of random numbers. 

Of course, all of these methods of sampling differ in one or more 
important respects. In any given problem, they may differ m their 
cost per unit sampled. Thus, some probability sampling methods are 
more expensive to use than others. Then, too, some methods may 
provide a more reliable procedure than others for some specific prob- 
lem—this means less risk. 

One statistical problem in the design of a sample study is to take 
account of these differences in cost and risk, and to determine and 
use the most economical probability sampling procedure available. 
Suppose now we briefly consider the potential usefulness of each of 
these four methods as compared to simple random sampling. 

Stratified sampling involves partitioning or stratifying a popula¬ 
tion into groups of elementary units and selecting a simple random 

sample from each group or stratum. 

A stratified sample is, obviously, not a simple random samp e. 
Note, for example, that with four elementary units, A, B, C, and D, 
simple random sampling gives every possible sample—AB, AC, AD, 
BC, BD, and CD—the same chance of being selected. However, sup- 
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pose that we divided the population into two strata: (A, B) and 
(C, D). If one unit were selected from each stratum, the only sam¬ 
ples with a chance of occurring are AC, AD, BC, and BD. Neither 
AB nor CD are possible with stratification. 

Among the principle reasons for stratification and, hence, for 
stratified sampling are (1) information for the various strata of the 
population, as well as the population as a whole, is needed; (2) strat¬ 
ified sampling may be more convenient than simple random sam¬ 
pling, as in the case of a survey being carried out over a wide geo¬ 
graphical area where different field offices could each supervise the 
sample study of each stratum; and (3) the use of stratification may 
reduce the risks of relying on sample information. 

Example 4.21 Suppose that your college wished to determine the 
extent to which various factors influenced students’ decisions to at¬ 
tend that particular school. The college might need such information 
in deciding on how to slant its promotional and public relations pro¬ 
gram and on how to modify its admissions policy. 

In selecting a sample of students from whom to obtain informa¬ 
tion, a stratified sampling procedure might be wise. For example, one 
basis for stratification would be freshman, sophomore, junior, and 
senior. Thus, if a total sample of 200 were to be selected, 50 might be 
picked from each class at random. Alternately, the population might 
be stratified by the criterion of whether or not the students were from 
outside of, or within, the college’s home state. Then, perhaps a simple 
random sample of 150 in-state and a simple random sample of 50 
out-of-state students would be chosen. Either or both of these criteria 
might be a useful basis .for stratification. If both were used, 8 strata 
would be formed and simple random samples selected from each. 

The advantage of stratification in such a problem might arise from 
the need for information about each stratum as well as about the 
population as a whole. Or, if not, it might arise because stratified 
sampling would involve a smaller risk than simple random sampling. 

Note, for example, that if the class-year basis of stratification 
were adopted, a sample of all freshman would be impossible (only 50 
could be freshmen). However, with simple random sampling such a 
sample has a small chance of occurring. Note that this would be dis- 
advantagous only if freshmen had different reasons for attending 
the school than other students. In such a case the sample would be 
unrepresentative in the characteristic of interest. The risk of this 
possibility is eliminated w T ith stratification. 

Generally speaking, whenever strata differ with respect to the 
characteristic of interest, stratified sampling involves less risk than 
simple random sampling. 

Stratified sampling often is confused with quota sampling (de¬ 
scribed in Example 4.15) because both involve subdividing the pop- 
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ulation into different groups. While it is true that they are similar, 
they differ in one important respect. The elementary units to be 
covered by a stratified sample are drawn at random; those covered 
by a quota sample are drawn by convenience. Furthermore, it is im¬ 
portant to remember that neither method guarantees a “good” sam¬ 
ple—both involve risks. However, under stratified sampling these 
risks may be measured in terms of probability, while under quota 
sampling there is no way of evaluating them. 

In addition, it may be noted that useful stratification, whatever 
its purpose, demands some a priori knowledge of the population to 
provide a basis for stratification. Generally speaking the problem of 
providing this basis is a nonstatistical one. 

Cluster sampling involves partitioning a population into groups or 
clusters of elementary units and selecting a simple random sample 
of clusters. 

For example, suppose we want to select a cluster sample of 40 
dwelling units from a population of 1,000. We might form 250 clusters 
of 4 neighboring homes in the population. A simple random sample 
of 10 clusters would give us a total sample of 40 dwelling units. 

You can see the main advantage of cluster sampling from this il¬ 
lustration. It is represented by the savings in time and money that 
accrue from the fact that some elementary units falling into our 
sample will be close together (those in the same cluster). However, 
this also points up a disadvantage in cluster sampling. Roughly 
speaking, neighboring elementary units may have similar character¬ 
istics. Hence, any cluster ordinarily is more or less homogeneous— 
it cannot be expected to provide as much different information about 
a population as, for example, four independently selected elemen¬ 
tary units. 

The methods of selecting probability samples that we have con¬ 
sidered up to now all have at least one characteristic in common— 
they are all fixed-size sampling methods. That is, the sample size is 
chosen a priori, and this number is selected. After, and only after, the 
whole sample is drawn are observations made and the appropriate 
action taken. 

However, some methods of probability sampling do not share this 
characteristic. For example, double sampling involves plans for se¬ 
lecting one’s sample in two stages. One sample is drawn, and depend¬ 
ing on what is observed, some action is taken or a second sample is 
selected. Of course, if a second sample is selected, action then must 
be taken. 
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Ordinarily a double sampling plan will require a smaller total sam¬ 
ple than simple random sampling to achieve the same reduction in 
uncertainty. This is its main advantage. Its relative complexity is 
one administrative disadvantage. 

Sequential sampling is an extension of the basic idea of double 
sampling. However, elementary units are selected and observed one 
by one and a decision on whether to take a certain action or continue 
sampling made after each observation. 

In conclusion, it must be pointed out that no matter what type of 
sampling procedure is to be used—simple random, sequential, or 
cluster, etc. one must recognize that risks are involved. Different 
probability sampling designs lead to different risks, but in all cases 
these risks may be measured in terms of probability (our next chap- 
ter's topic). 

4.9 You Should Now Know That 

After defining a decision problem statistically, one must determine 
whether any observations actually are necessary. 

If they are, information may be provided by a primary study- 
one conducted to provide a basis for decision on the particular prob- 
lem at hand. 

Some pertinent information may have been collected for some 
other purpose, yet it may be useful in one’s immediate problem. 
Such information is called secondary data. 

Secondary data may be internal or external, depending on the 
auspices under which they were collected. 

Pertinent information may be derived on the basis of a sample or 
a complete count. 

A sample is part of a population. Sampling is the process by which 
a sample is obtained. 

Five possible reasons for sample selection rather than a complete 
count of a population are (1) economy, (2) timeliness, (3) destruc¬ 
tive nature of a test, (4) accuracy, and (5) an infinite population. 

Decision making on the basis of sample information is risky. These 
risks can be evaluated objectively only if probability sampling is 
used. 

Probability sampling is a selection procedure in which every ele¬ 
mentary unit has an ascertainable chance of appearing in the sam¬ 
ple. 

Judgment samples are those for which elementary units are se¬ 
lected on the basis of an expert's knowledge of the field of inquiry. 
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Convenience samples are those for which elementary units are 
selected because they are readily available. 

One important type of probability sample is called a simple ran¬ 
dom sample. It is one which is drawn in such a manner that every 
possible sample of a given size has an equal chance of being selected. 

Probability samples are drawn rather easily through the proper 
use of random numbers. 

Other probability sampling procedures include stratified sampling, 
cluster sampling, double sampling, and sequential sampling. 

4.10 You Should Now Be Able to Solve 

1. A retailer has data, ranging back 2 years, on the number of sales 
that he has made daily. He also has data on the total dollar sales per 
day. Are these primary or secondary data? Do the data represent 
complete or sample information? Explain your answers. 

2. In many automobiles gears are shifted by means of push but¬ 
tons. Assume that an automobile manufacturer receives a shipment 
of push-button panels to be installed in cars. An incorrectly wired 
panel would result in a car going in reverse when the drive button is 
pushed, and vice versa. Of course, if an improperly wired panel is 
installed, it would be detected when the completed car is driven off 
the assembly line. 

Should the manufacturer check the panels in deciding whether or 
not to install them? 

If so, should he check the entire lot or a sample selected from the 
lot to decide whether or not to install the panels? 

3. In each of the following instances indicate whether you would 
suggest use of a convenience, judgment, or probability sample. Ex¬ 
plain the reasons for your choice: 

а) A manufacturer of an industrial product desires to prepare a ques¬ 
tionnaire to explore potential new uses for his product. To help de¬ 
sign the questionnaire, a sample of present users and nonusers of 
the product is to be interviewed, 

б) A company is to select 100 workers from its 3,000 employees to de¬ 
termine the attitude of its employees towards a recently introduced 
social activities program. It will use this information to decide on 
whether the program is operating smoothly and should be contin¬ 
ued, or whether changes should be made. 

c) The school districts of a state have been in difficult financial straits 
because of the rapid increase in pupil enrollments. The state s de¬ 
partment of education, therefore, decides to undertake a detailed 
study of the problems of five school districts. 
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4. In each of the following instances describe how you would select 
the required sample. (First determine the population of interest and 
a suitable frame.) 

a) A simple random sample of 50 from invoices for a given month that 
* are numbered from 2,463 to 3,981 consecutively. 

h) Samples of apples from trees to compare the effectiveness of differ¬ 
ent sprays. 

g) A sjmple random sample of entries in an accounts receivable ledger. 
The ledger contains 150 accounts with varying numbers of debit and 
credit entries and is to be checked for accuracy. 

d ) A sample of sea water at a beach to determine bacteria count before 
approval of the beach by the Board of Health for public use. 

5. Appraise the following methods of sample selection. 

a) The “inquiring reporter” device used by newspapers to determine 
opinions on public issues. 

h) The letters received by Congressmen as a guide to the attitude of 
their constituents in reference to a given bill. 

6. A company installs, on a trial basis, a new safety (accident pre¬ 
vention) program in its manufacturing plant. The month before the 
new program there were 15 accidents (per 1,000 workers). The first 
month of the new program there were only 10 accidents (per 1,000 
workers). Should the new program be continued? Why or why not? 

7. A time-and-motion-study man clocks a particular polishing op¬ 
eration 10 times. His results are (in minutes): .15, .13, .14, .16, .12, 
.15, .15, .14, .16, .15. Is this a probability sample? If so, what type of 
probability sample is it? What population is it from? 

8. The Bureau of Labor Statistics asks the prices monthly (or in 
some cities, quarterly) of certain items of food in connection with 
the Consumer Price Index. 

a) The food items are chosen on the basis of judgment. Why? 

b) The stores from which price quotations are taken are chosen at 
random. Why? 

4.11 You Will Also Find That 

Clear discussions of sources of secondary data are included in the 
following: 

Blair, Morris Myers. Elementary Statistics, chap. xxiv. Rev. eel. New 
York: Henry Holt & Co., Inc., 1952. 

Croxton, Frederick E., and Cowden, Dudley J. Applied General Sta¬ 
tistics, pp. 45-49. 2d ed. New York: Prentice-Hall, Inc., 1955. 
Stockton, John R. Business Statistics, chap. iv. Cincinnati, Ohio: 
South-western Publishing Co., 1958. 
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Pertinent references on sampling, the reasons for sampling, types 
of sampling procedures, and how to sample include the following: 

Mandel, B. J. Statistics for Management, pp. 150-79. Baltimore, Mary- 
land: Dangary Publishing Co., 1956. 

Neiswanger, William A. Elementavy Statistical Methods, pp. 105—44. 

Rev. ed. New York: Macmillan Co., 1956. 

Neter, John, and Washerman, William. Fundamental Statistics for 
Business and Economics, pp. 267-81. New York: Allyn & Bacon, Inc., 
1956. 
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Probability Theory 


5.1 Statistics and Statistics and Statistics 

We began our study of statistics by studying the nature of busi¬ 
ness problems and by considering how to translate some of these 
problems into statistical terms-i.e., by defining the population and 
by selecting the decision-parameter(s). We then considered the fact 
that many decisions must be made on the basis of sample studies 
since complete information is not always available, or may be too 
costly or time consuming to obtain. Such studies involve risks, 
therefore, a statistician ordinarily insists on a probability sample as 
a basis for decision because only then can these risks be evaluated 

samples 6 ^ ^ ^ Chapter we learned how to select probability 

So now we have business problems, on the one hand, and knowl¬ 
edge of how to select probability samples, on the other. Our next 
ask is to shake hands with ourselves; that is, to co-ordinate these 
two aspects of our subject matter. Perhaps the basic questions in 
w ich we are interested can be summed up by saying: “We know 
how to draw probability samples-but what can we do with them? 
How can they help us m making decisions—how can they help us 
m solving managerial problems?” 

• Fl , rSt j° f f U We mUSt realize that in sample studies, despite the risks 
involved, decisions must be made on the basis of information con¬ 
tained m the sample. Ordinarily, however, a sample will contain 
some information which is not relevant to making a decision. For 
example, when a sample of screws is taken from a lot and their di¬ 
ameters measured, we ordinarily are not interested in the array of 
these diameters. We are interested in the sample average and/or the 
sample variance, or the proportion falling outside of specifications 
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-but only for what they indicate about the pertinent characteristics 
of population. For instance, the sample average is important for 
what it can tell us about the population average. 

Thus, the sample information which is relevant to the decision 
parameter(s) of interest is what concerns us, and this information 
usually can be summed up in such measures as the sample propor¬ 
tion, or the sample mean, or the sample standard deviation These 
measures are called statistics. Thus, we have a third use for this 
term. Previously, we have spoken about the fact that statistics are 
data, and that statistics is the field we are studying Now we shall 
speak of a statistic as a measure that summarizes information from 

a Youmight have noticed that the aforementioned statistics have 
names similar to some decision-parameters. These statistics and the 
corresponding decision-parameters are almost identical insofar as 
their definitions or formulas are concerned. The mam difference be¬ 
tween the two types of measures—and it is an important one is 
that a decision-parameter summarizes information about a popula¬ 
tion while a statistic only summarizes information given m a sample. 

Since we will be concerned with sample observations and statistics, 
it will be convenient to represent them by some generalized notation 
We will continue to denote the size of a sample by n and the size of 
a population by N. In addition, the various individual observations 
in our sample will be represented by 


Xn 


Xi, X2, Xg } . . 

The represents the first observation in our sample, z 2 the second 
observation in our sample, etc. The *. is, of course, the last observa¬ 
tion. In general, x t is the ith observation m our sample 

Notice that in this notation we are using (small letter) * as a 
symbol for an observation. Previously, in Chapter 3, we used (cap¬ 
ital) X in a similar connection. However, at that time we were talk¬ 
ing about observations in a population (X t was the first observation 
in the population, etc.). These values were constants—properties of 
the elementary units under consideration. Now the story is entirely 
different. The observations that fall into a sample are variables, and 
to emphasize this difference we use small letters. The value of any x 
can be any one of the N population values, depending upon which 
one of all the possible samples is selected and upon the order m whic 
the items are selected. Furthermore, since we are to select probabi - 
ity samples, the sample observations are obtained by means of a 
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random process and, therefore, they are called random variables . The 
use of x thus connotes the fact that each value appearing in our sam¬ 
ple is a result of chance selection. 

We used Greek letters as symbols for summary population meas¬ 
ures—decision-parameters. We now will use Latin letters as symbols 
for summary sample measures—statistics. Thus, the sample mean 
will be denoted by x (read rr-bar) and expressed in a formula as 


2 # • 

^ n (f = 1, 2, . . . , n) . 

You should remember that 3 (capital sigma) is the summation sign 
with start and stop instructions stated at the right. Thus, 

2Xi (* = 1 , 2 , . . . , n ) . 

means Xl + x 2 + . . . +x n ; namely, the sum of all n variates which 
fall into our sample. Thus, x, the sample mean, is defined as this sum 
divided by n. 

The sample variance will be denoted by s 2 with the formula 


g 2 = 3p 2 

n 


(i = 1, 2, ... , n) . 


Furthermore, the sample standard deviation (the square root of 
the sample variance) will be denoted by s with the formula 


= fc( Xi ~~ ^) 2 

\ n 


(i = 1, 2, . . . , n ) . 


The sample mean and sample standard deviation are measures 
derived from samples where the observations are variates. When the 
observations are attributes, the sample proportion often is the sta¬ 
tistic of interest. The sample proportion will be called p. It is com¬ 
puted in the same manner as n, though p is a statistic while tt is a 
decision-parameter. For example, suppose that a shipment of resis¬ 
tors were received by a manufacturer. If a simple random sample of 
25 resistors were selected and tested, and 2 were found defective, the 
proportion defective in the sample would be 


The definitions of the sample measures x, s 2 or s, and p are simi¬ 
lar to those of their counterparts in the population, a 2 or <r, and tt 
respectively. It is important to recognize, however, that the simi¬ 
larity is only in form since /», o- 2 or <r, and n always refer to a given 
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population. They are properties of that population and have only 
one value—they are invariant. Their values remain the same 
whether we know them or not. They are decision-parameters, and 
in no way depend on the sample we select. 

However, x, s 2 or s, and p always refer to a sample. They are sum¬ 
mary measures of the information in the sample. If we were to draw 
samples of the same size over and over again from the same popu¬ 
lation, x, s 2 or s, and p would vary from sample to sample. Their 
values depend on which sample of all possible samples is selected. 
Statistics, therefore, are variables. 

Nevertheless, since in many problems we are uncertain about the 
exact value of a decision-parameter, we may be forced to rely on the 
corresponding statistic as a criterion for decision. This is risky be¬ 
cause statistics are variables. However, when they are computed 
from probability samples, they, like the individual observations, are 
random variables—and the way in which they vary can be predicted. 
Let’s consider the implications of the use of a statistic instead of a 
decision-parameter as a basis for decision by way of an illustration. 

Example 5.1 A soap company must decide what proportion of its 
current output should be devoted to a more expensive detergent than 
its present product. If it knew tt, the proportion of its customers who 
would buy the new product, it could make its decision. For example, 
suppose the firm plans to produce immediately 10,000 boxes of de¬ 
tergent If it knew tt, it would produce 10,000tt boxes of the new 
product and 10,000(1 - tt) boxes of the old detergent. (Note that 
l - iris, by definition, the proportion of customers in the population 
who say they would buy the old detergent.) 

In the absence of an economical procedure for obtaining complete 
knowledge about tt, the company might draw a sample of customers. 

A reasonable statistic summarizing pertinent information about this 
sample would be p, the sample proportion of customers who would 
buy the new detergent. 

One statistical decision rule for the company to use m its problem, 
then, would be to produce 10 , 000 p boxes of the new product and 10 ,- 
000(1 _ p) boxes of the old product. That is, in taking action the 
firm might use p in place of tt, although in doing so they would be 
taking some risk. 

There are many possible values of p that could be obtained in a 
situation such as that in the preceding example. The actual value of 
p obtained depends upon which one of all possible samples is se¬ 
lected. Hence, there are risks involved in taking action based upon 
such a statistic. However, if there were some way of determining the 
pattern of variability of p, we would have an indication of the pos- 
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sible risk involved in utilizing the sample proportion (a statistic) 
rather than the population proportion (a decision-parameter) as 
our criterion for decision. Thus, roughly speaking, if we determined 
that 95 per cent of all possible sample proportions ranged from 40 
per cent to 60 per cent, we would know that the risk is greater than 
if 95 per cent of the possible sample proportions ranged from 48 to 
52 per cent. In general, we know that the greater the variability in 
the possible values that a statistic can attain, the greater the risk in¬ 
volved in the use of the statistic as a criterion for decision. 

Fortunately, the mathematical theory of probability enables us to 
determine the pattern of variability of random variables. But sta¬ 
tistics are random variables since they summarize observations that 
are obtained by a random process. Therefore, probability theory 
will enable us to determine the pattern of the variability of statistics 
and, hence, to measure sampling risks. 

In the remainder of this chapter, we shall deal with the elements 
of probability theory and see how this theory can be helpful in meas¬ 
uring the risks involved in basing decisions on sample information. 

5.2 Mathematical Probability 

Probability, chance, and likely are three words which are quite 
common in everyday conversation. We have all made, or at least 
heard, statements such as: the probability of a head when a coin is 
flipped is one half; the likelihood that there is life on Mars is low; 
the chances that the Yankees or the White Sox will win the pennant 
next year are very high; and the probability of holding four aces in a 
game of poker is less than the probability of holding a royal flush. 

In which of these connections or in what other situations haye you 
expressed similar ideas? What did you mean by them? It will be 
advantageous for you to give a few minutes of thought to these ques¬ 
tions. Later you can check your intuitive notions against the more 
formal development given below. But for the present, your intuition 
should help you to appreciate some of our introductory remarks. 

Our present aim is not to consider the term “probability” and its 
synonyms in their everyday context. Rather it is to give these ideas 
a technical, rigorous, and specific meaning. Thus, succeeding sections 
present some of the basic principles of mathematical probability as 
they have been laid down by mathematicians and statisticians over 
the past two or three centuries. This type of probability may not 
conform exactly to your intuitive feeling for the term as it is used in 
everyday conversation, but it should be somew T hat similar. In any 
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event, our short survey will involve, initially, the principles of the 
theory of probability as a branch of mathematics. Secondly, it will 
involve the applications of the results of mathematical probability 
in the field of statistics. 

Incidentally, the difference between mathematics and statistics is 
borne out in the last two sentences. In short, the development of the 
theory of probability is mathematics, while one of the applications 
of this theory is statistics. The distinction may be clearer if we draw 
an analogy with geometry. This subject, as you may know, is de¬ 
veloped mathematically; that is, its development hinges on mathe¬ 
matical principles: reasoning from axioms and postulates. The task 
of applying the results of this development to problems in the real 
world then falls to the engineer, the architect, the astronomer, or 
others in applied fields. In exactly the same way it is the statistician 
who may apply the mathematically derived results of probability 
theory. Nevertheless, it must be remembered that the foundation of 
statistics and statistical applications is the mathematical theory of 
probability. 

As we have mentioned, mathematicians and statisticians like to 
give a more precise and definite meaning to probability than do most 
people in everyday conversation. What, then, is their definition of 
probability? This is an easier question to ask than to answer in an 
elementary textbook. We might hint at the difficulty by saying that 
there are many ideas on how probability should be defined. There is, 
however, one point of almost universal agreement. Almost everyone 
in the field of mathematics and statistics agrees that mathematical 
probability should be expressed quantitatively. Thus, their attention 
is centered upon a numerical measure of probability, or as it is some¬ 
times called, a probability measure. 

The setting for our problem is paralleled by a very familiar phe¬ 
nomenon, the weather. In everyday conversation, we often use the 
words “hot,” “cold,” “warm,” “cool,” “pleasant,” etc. These have 
only a very general qualitative meaning. On the other hand, we are 
able to measure temperature on either a centigrade or a Fahrenheit 
thermometer. We are able to give temperature a quantitative mean¬ 
ing. This is precisely what we want to do with probability—to meas¬ 
ure it on a scale. Thus, we shall interpret our probability measure 
as follows: 

The probability of an outcome, A, is defined as the proportion of 
times A would occur in an infinite series of repeated trials of the 
same kind. 
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If we let A be some specified outcome in a random experiment, the 
symbol P(A) will refer to the probability that A will occur. Then 
our definition may be expressed in a formula as 

7Y .1) __ Number of Times A Occurs 

Number op Repeated Trials op the Same Kind 

Thus, when we talk about the probability of an outcome, we mean 
the relative frequency with which it would occur if the experiment 
were repeated an infinite number of times. For example, we defined 
simple random sampling as a method of selection which gives every 
possible sample of size n the same chance of being chosen. What 
does this mean? It means that to select such a sample we must use a 
method which if repeated indefinitely would result in each combina¬ 
tion appearing with the same relative frequency. 

There are two aspects of the definition of probability that we 
must discuss: first, the notion of “infinite series” and “long run”; 
and, secondly, the idea of “repeated trials of the same kind.” 

The idea of “infinite series” and “long run” may be cleared up by 
a conventional illustration. In terms of our definition, what would 
we mean if we said that the probability of a coin falling heads when 
it is flipped is equal to .50? We mean that if this same process, flip¬ 
ping the coin, were continued indefinitely under the same conditions, 
we would expect 50 per cent of the outcomes to be heads. 

We should also point out what the .50 does not mean. It does not 
necessarily imply that as we continue tossing the coin, we will come 
to a stage when the number of heads will always be equal to the num¬ 
ber of tails. In fact, the difference between the number of heads and 
number of tails may very well get larger as we increase the number 
of flips. Thus, in 100 tosses of the coin we might obtain 60 heads and 
40 tails, or an excess of 20 heads. After 1,000,000 tosses, we might 
have 500,500 heads and 499,500 tails, or an excess of 1,000 heads. 
The relative frequency of heads, however, has decreased from .60 
to .5005—very close to our seemingly “good” estimate that the 
probability of a head is .50. 

How do we know that the probability of a head is .50 when we flip 
a coin? The answer to this question is that we don’t know—50 per 
cent is only a conjecture based on the apparent symmetry of the 
coin. We would know the true probability exactly only after ob¬ 
serving an unlimited number of flips of the coin. This obviously is 
impossible. A little thought, in fact, will lead you to the conclusion 
that we can never determine any probability exactly. To do so would 
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require an infinity of observations! What can be done so that we can 
ease ourselves out of this quandary? Fortunately, even though we 
can never know any probabilities exactly, there are various proce¬ 
dures for estimating them. 

One method of estimating probabilities is on the basis of past ex¬ 
perience. Two examples will illustrate this idea. 

Example 5.2 Suppose that an office manager wishes to determine 
the probability of a paper being misfiled by his file clerks. He ex¬ 
amines 1,000 papers filed during the past week and finds 30 misfiled 
papers. On the assumption that the process of filing or misfiling papers 
is a random process, an estimate of the probability of a given paper 
being misfiled, P{A), is 


P(A) = 


30 

1,000 


.03 . 


This value indicates that in the long run approximately .03 of the 
papers will be misfiled. Although the office manager cannot predict 
which individual papers will be misfiled on the basis of this infor¬ 
mation, the estimate does give him some insight into the efficiency of 
the filing process. 


Example 5.8 In setting their rates, life insurance companies make 
use of mortality tables which give them the probability of a person 
dying during any one year of his life. These tables are based on past 
experience, gained by keeping records on ages at which persons die. 

For example, one mortality table (there are several, but all give 
similar probabilities) lists the probability of a 21-year-old person 
living to 22 as approximately .992. Since he can only live or die, the 
probability of his dying before reaching 22 is approximately .008. 

Here again the process is viewed as a random process. Probabilities 
such as these, of course, do not give probabilities strictly applicable 
to any one individual, who might be rather sick or very, very 
healthy. The probabilities refer to the population at large. 

The second method of estimating probabilities is one based on the 
nature of the process we are observing. For example, when a coin is 
flipped, there are two possible outcomes, heads and tails; when a die 
is thrown, there are six possibilities, 1, 2, 3, 4, 5, and 6. Ordinarily, 
coins and dice ar e symmetrical. Unless we have reason to believe 
otherwise, it seems logical to assume that each possible outcome is 
equally likely or equiprobable. Thus, if a die is thrown, what is the 
probability a 4 shows? Using our equiprobable assumption, P(4) 
equals % since there is only one way a 4 can occur and six different 
outcomes are possible. 
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As an illustration of a statistical application of the equiprobable 
assumption, reconsider the use of random number tables for select¬ 
ing probability samples. Because of the nature of these tables and 
* the ways in which they are derived, it is reasonable to assume that 
every digit, 0, 1, 2, . . . , 9 is equally likely in such a table. That 
is, we may assume that the probability of any digit occurring is Yio- 
Likewise, we may assume that every pair of digits, 00, 01, 02, ... 
99, is equally likely—that is, each has a probability of %oo of oc¬ 
curring. 

The equiprobable rule for estimating probabilities must be used 
with care. For example, just because you can pass or fail your course 
in statistics, you wouldn't care to say that both outcomes are equi¬ 
probable. 

We now turn to the second idea in our definition—the matter of 
repeated trials of the same kind. This requires some qualification 
because we have assumed that trials can be repeated. However, a 
man can only live or die once during his 21st year; in testing a given 
light bulb it will or won't burn out in its first 100 hours of use; etc. 
These experiments cannot be repeated. This presents a problem 
when we attempt to estimate a probability on the basis of past ex¬ 
perience because if a trial cannot be repeated we wind up with very 
little experience. 

The problem is usually resolved by pooling many trials of the 
same kind. To estimate probabilities of living and dying we observe 
not one individual but many individuals of the same kind (although 
no two individuals are the same, we group those with the same age, 
occupation, race, etc.). When we flip a coin repeatedly, we assume 
that the flips are of the same kind (although they may not be ex¬ 
actly so; for example, the coin wears and/or your finger gets cal¬ 
loused). 

No trial is exactly repeatable; no two trials are of exactly the 
same kind in any business situation. Thus, in determining the prob¬ 
ability of a misfiled paper in Example 5.2, the trials might not have 
been of the same kind. The file clerks might have changed, the light¬ 
ing system could have been improved, or a new supervisor might 
have been appointed. All these factors might result in trials not of the 
same kind. The best we can do is try to lump together those that are 
approximately of the same kind when we must estimate a prob¬ 
ability. But even barring this remedy, our definition of probability 
still provides us with a theoretical bench mark. Remember, then, 
that P(A) means the proportion of times A would occur if the ex- 
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periment were repeated an unlimited number of times under the 
same conditions. 

5.3 Properties of a Probability Measure 

Now, we turn to discuss three basic properties of a probability 
measure. These properties follow almost immediately from our defi¬ 
nition of probability. 

First of all, what are the numerical limits of a probability meas¬ 
ure? What are its minimum and maximum values? The answer is 
that for any outcome the probability measure may have a value 
as small as 0 or as large as 1. 

These extreme values of 0 and 1 have a convenient interpreta¬ 
tion. Thus, if an outcome never occurs, no matter how many trials 
are attempted, the probability is considered 0. This is reasonable in 
view of our interpretation of probability; if an event never occurs 
then its relative frequency is 0. Furthermore, if one outcome always 
results from an actual experiment, the probability of this outcome 
may be considered as equal to 1. This, too, is consistent with the 
relative frequency definition of probability, as you easily can verify. 

For example, on the basis of past experience we might estimate 
the probability of a baby being born with green hair as 0. On the 
other hand, we might also say that the probability that every person 
alive today will die at some time is 1. 

A second property of a probability measure is that the sum of the 
probabilities of all possible outcomes is equal to 1. For example, sup¬ 
pose that we select at random one entry from an accounts payable 
ledger in order to check its accuracy. The two possible outcomes of 
this experiment are the selection of an incorrect entry and the selec¬ 
tion of a correct entry. Therefore, the probability of an incorrect en¬ 
try plus the probability of a correct entry must equal 1. Thus, if the 
probability of selecting a correct entry is .8, the probability of select¬ 
ing an incorrect entry must be .2, since they are the only possible out¬ 
comes. Furthermore, suppose we were to select two entries from the 
accounts payable ledger. Then 0, 1, or 2 of these may be incorrect. 
Whatever the probability of each of these outcomes, their sum must 
be 1. 

The third and last property of a probability measure refers to the 
likelihood that one or the other of two possible outcomes occurs. We 
will consider this property only as it applies to outcomes that are 
mutually exclusive . Mutually exclusive outcomes are those which 
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cannot occur at the same time. Some illustrations should fix the idea 
in your mind. 


Example 5.4 A personnel administrator must decide on one of 
several applicants for a position with his company. His alternatives 
which include hiring J. Smith, H. Jones, M. Jackson, etc., are mutu¬ 
ally exclusive since he is to hire only one of the applicants. 

A worker may be absent exactly 0, 1, 2, 3, 4, or 5 days in a given 
week. All of these possibilities are mutually exclusive. Thus, for ex¬ 
ample, if he is absent exactly twice, he cannot be absent ’exactly 
once. J 


A simple random sample of n students is to be selected and their 
average age calculated. Among the possible outcomes of this random 
experiment are x - 22.1 years and x = 20.5 years—two outcomes 
which are mutually exclusive. If the sample mean is 22 1 years it 
cannot be 20.5 years. y ’ 

,. r “ do “ * a ™P ]e of 2 machines from a group of 6 machines 

(A, B, C, D, E, and F) is selected. If the sample AB occurs, AC can¬ 
not occur nor can AD, AE, etc. All of these outcomes are mutually 


The rule expressing the probability of mutually exclusive out¬ 
comes may be stated as follows: The probability of either of two 
mutually exclusive outcomes occurring is equal to the sum of their 
individual probabilities. If we write P(A or B ) for “the probability 
of either A or B,” we can summarize this rule as 


P(A or B) = P(A) + P(B ) . 

This rule is sometimes called the addition rule. We must remember 

that this rule is valid only if the two outcomes are mutually exclu¬ 
sive. 

This rule may be extended to three or more mutually exclusive 
outcomes as well. Thus, if A, B, C, etc., are all mutually exclusive: 

P(A or B or C or . . . ) = P(A) + P(B) + P(C) + . . .. 

Example 5.5 Suppose that the probabilities of exactly 0 12 34 
etc., machines being out of order at the same time in a plant have’ 
been estimated on the basis of past experience. Let P(0) be the 
probability of 0 machines being out of order, P(l) the probability of 
1 machine, etc. Then the information can be summarized as follows: 

P(0) = .449 
P(l) = .360 
P( 2) = .144 
P(3) = .038 
P(4) = .008 
P(5 or more) = .001 . 
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What is the probability of exactly 0 or exactly 1 machine being 
out of order at the same time? Since 0 and 1 are mutually exclusive 
outcomes, the addition formula can be applied. 

P(0 or 1) = P(0) + P(l) = -449 + .360 = .809 . 

What is the probability of 2 or less machines being out of order? 
Similar reasoning shows that 

P(0 or 1 or 2) = P(0) + P(l) + F(2) 

= .449 + .360 + .144 
= .953 . 

You will recall that the addition rule is not applicable when the 
outcomes are not mutually exclusive. For example, if you were to 
select a student at random from your statistics class, the probability 
of selecting a junior or a student over 21 years of age does not refer 
to mutually exclusive outcomes. A student may be both a junior and 
over 21 years at the same time; one outcome does not exclude the 
other. Other simple rules have been developed to cover the calcula¬ 
tions of probabilities for nonmutually exclusive outcomes, but we 
shall not consider them in this text. 

5.4 Repeated Trials—Conditional Probability 

Up to this point we have dealt with probabilities of outcomes in 
a single random experiment or trial. For example, we spoke of the 
probability of selecting a correct entry in an accounts payable 
ledger, and of the probability of 1 or 2 machines being out of order 
at one time. In this section, we shall determine how to compute the 
probability of a given sequence of outcomes in repeated experi- 
ments or trials. 

Example 5.6 A coin is tossed 3 successive times—these represent 
3 repeated trials. One question dealing with these trials that might 
be of interest is: “What is the probability of the sequence head- 

head-head ?” . . . ,. 

An employee is to report to work 5 days m a given week—this 
situation presents 5 repeated trials. Each trial may result in the 
worker being absent or present. The type of question we will attempt 
to answer in this section is exemplified by “What is the^ probability 
of the sequence absent-absent-present-present-present?” 

Three machines are observed for a week to determine whether they 
require maintenance work or not. Each machine represents 1 trial 
the 3 machines represent 3 trials. 

In developing rules for computing probabilities of a given se¬ 
quence of outcomes from a series of repeated trials, there are two 
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basic ideas to keep in mind. First of all, we desire rules for express- 

ahiliTp Pr f 0ba f r ht ;f °f a sequence of outcomes in terms of the prob¬ 
abilities of individual outcomes. This situation is similar to the one 

tTrms e of pfi) ; e d £ J Cti0n it 3 Wh6re ° r £) Was lessee Z 

terms of P(4) andP(P)—as their sum. Thus, to compute probabili- 
ties of sequences we will have to know individual probabilities in or¬ 
der to apply the rules to be developed 

trifrand 1 to de 77 t0 77^ betWeen tW ° of "Peated 

trials and to develop two different rules for these types Thus re 
peated trials may b e dependent or independent, dipending upon 
w ether any.one trial influences the others or not. For example two 
successive flips of a coin should not influence each 
ey are independent. On the other hand, suppose that we were to 
draw two cards from a deck. The outcome of the first draw would 

drfw'n firsHt 7 (e ' g ‘’ if thc aCe ° f cIubs were 

be dependent d aWn SC0 ° nd) ' Thus - these trials would 

In this section, we shall develop a rule for computing the proba¬ 
bility of a sequence of outcomes when the trials are dependent- 
later we shall consider independent trials. Thus, the rule for calcu¬ 
lating the probability of a specified sequence of two outcomes in two 
dependent trials states: The probability of a sequence 7Zoout- 
comes is equal to the probability of the first outcome multiplied by 
e probability of the second outcome given that the first outcome 
hns occurred. You will observe that the probability of the second 
outcome is conditional upon the occurrence of the first outcome 

probability - The ruIe can be stated 
P(A and B) = P(A) X P{B\A) . 

The vertical line in (5|A) stands for the word “given.” Let us look 
at an illustration of the rule. S 100JC 

Example 5.7 Two forms of a general achievement test are to be 
se ected at random rom a group of 6. These teste are to be adm7 s ! 
tered to entering college freshmen. The forms are numbered 1 throueh 

le£ C H 1U?1 7 PTObability that forms 2 and 4 will be st- 

cted m that ordert When we select the first form of the test we 

ITflr *9-7 t0 d ? W fr T' U We USe random numbers, the probability 
of form 2 being drawn first is %. However if form 2 is 

next test must be drawn from the remaining 5. The probability of 

drawing form 4 is The drawing of form 4 L the secoKettSn is 
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conditional upon the drawing of form 2 first. The probability that 
forms 2 and 4 will be drawn in that older is 

P(2 and 4) = P(2) X P(4|2) 

= W 


This answer may be verified by enumerating the set of all possible 
sequences of two tests as follows. 


-7- 

When a random method of selection is used, each of these 30 dif¬ 
ferent sequences is equally likely. Form 2 followed by for “ * 
sents only 1 of the 30 possible outcomes, and, hence, its P robab ? 

TS— i. v *> ue TS, S.™ 

the application of the conditional probability rule. 

The conditional probability rule can be generalized for more than 
two outcomes as follows: 

P(A AND B AND C AND . . . ) = P(A) X P(B\A) X P(C\A AND B) X , 

where P(C\A and B) refers to the probability of C given that A and 
B have occurred, etc. Thus the probability of a 
comes in repeated random trials is equal to the product of the prob 
abilities conditional upon the occurrence of the preceding outcomes 
to fc“ q u».=. For example, .eee.sider the eituaf on desmMrn 
Example 5.7 and assume 3 test forms are to be selected. What is the 
probability that forms 2, 4, and 5 would be selected t» that 
The probability of selecting form 2 first is and the probability 
of selecting form 4 second, given that form 2 has been selected firs , 
is i] In addition, the probability of selecting form 5 third, given 
that forms 2 and 4 have already been drawn, is %. Hence, the prob¬ 
ability of forms 2, 4, and 5 being chosen in that order is 


P (2 AND 4 AND 5) 


i X I X I = J_ 

g X 5 X 4 120 


The conditional probability rule for two outcomes and its exten¬ 
sion for three or more outcomes are quite useful in the development 
of statistical theory. In Chapter 4 it was pointed out hattojdect 
o simnle random sample it is not necessary to enumerate all possible 
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combinations of elementary units and select one of these at random 
An alternative procedure was suggested—randomly select n elemen- 
tary units one at a time. We have progressed sufficiently in our study 
of probability theory to indicate the validity of this statement 

Example 5.8 Suppose we consider a population represented bv 
elementary units which may be denoted by A, B, C, D and E Anv 
procedure for selecting a simple random sample of 2 must give each 
possible combination of 2 elementary units 


AB 

AC 


AD 

AE 


BC 

BD 


BE 

CD 


CE 

DE 


the same chance of being selected. That is, each of these 10 possible 
samples must have a probability of Vl0 of being selected ? 

buppose, then, we were to select 1 elementary unit from the podu 
lation at random and follow this by a second random selectio/from 
the remaining 4 elementary units. Does this method give every pos¬ 
sible sample of 2 the same chance (namely, y l0 ) of being drawn? 

Let us answer this question for the sample represented by BD 
Thus, consider the probability that unite B and D will be selected in 
any order. We must consider the possible order D and then B as well 
as the order B and then D. The probability of D and then B £ by th e 
conditional probability rule, J y 


~ X \ 
5 4 


20 


But,^ the same reasoning, the probability of B and then D also 

These two sequences are mutually exclusive outcomes for they can- 

Th “' the ptob ‘ bi,ity - - - 

J_ , 1 

20 _r 20 20 _ 10 ’ 

This is the probability of the sample B and D occurring in any order 

Se alTi” 7 ^ may Sh ° W th<3 Pr ° babilit y of “7 s ^h sam- 

sewfnv |' e ™ lts , indicate that method suggested in Chapter 4 for 
selecting a simple random sample by randomly drawing elementary 
units one at a time is valid. It is not necessary to enumerate all possh 
ble samples and select one of these at random. 

nrI°K-r’ m n °, te th f f this last illl,strati011 both the conditional 
p a llity rule and the mutually exclusive rule were used Since 
these rules sometimes are confused with each other, a comment on 
their applicability is in order. The conditional probability rule is 
applied to determine the chance that some particular sequence of 
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outcomes (e.g., B and D) occurs m a series of trials. ^ mutuaUy 
exclusive rule is applied in such a situation only to det ®™ me the _ 
probability of one or another of two sequences (BD or DB) occur 
ring Though both rules often are used in the same problem, remem¬ 
ber that they are applied to answer different questions. 

Just as we used probability theory to indicate the justification of 
one statistical sampling procedure in Example 5.8, we may use it to 
point up the validity of other statements made during our study o 
sample selection. For one thing, it was noted that several methods 
of probability sampling give each elementary unit the same 
of being chosen—that simple random sampling is not the o y 
procedure with this property. 

Fxanwle 5.9 Assume that we have a population represented by 
500 elementary units—for example, 500 index cards in a file. • e wis 
random sample of 2 cards What is the probability 

that a particular card will be drawn into the sample. 
h Any one card may appear as the first item selected or as the sec- 
nnd item selected. The probability it appears first is Vsoo because 
there is a total of 500 cards. Any one card may be the second item 
selected only if another card is selected first. By the conditional 
probability rule, the probability of this sequence is 

499 1 _ _L . 

500 X 499 500 

Note that the 49 %oo represents the chance of a given unit not being 
selectedon the first draw" while the y 499 is the probability of it being 
selected on the second draw, given that it was not selected on the first 

dr Thus the probability that a card is drawn first is y 5 „o. and t! f 
Drobability that it is drawn second is y 5 oo- Since these are mu u y 
^“ outcomes, the probability that a card will be drawn first or 

second is 

1 i _L = — . 

500 ^ 500 500 

You may notice that this is the ratio of the sample size (n = 2) to 
to the*population size (N = 500). In general, it can be shown that 
with simple random sampling each and every elementary unit has a 
probability of n/N of appearing m the sample selected. 

Examvle 5.10 Cluster sampling was defined in the last chapter as 
a method of selection which involves dividing a populationi into 
clusters of elementary units and selecting a simple random sample 
of clusters. 
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For example, suppose cluster sampling is to be used in a 

2 800 t0 obtain ^formation on automobile part sales for a state’s 
2,800 gasoline stations. The 2,800 elementary units (gasoline sta 

tlZeTthePT 5U?6d , int ° 5 / tati0nS PCr ° luster ’ Uniting ^ S 
i + j ‘■’■ Then, if a simple random sample of 100 clusters were se 
lected, this would represent 500 gasoline stations ^ 

since the sample of 100 clusters from the total of 560 is tn ho o 
simple random one, the probability of a particular isteVbdng se 
ected in the sample is 100/560. In general, the probability of a clus- 
emg selected m a cluster sample is equal to the ratio of the 
number of clusters in the sample to the total number of clusters 

ter 1 W dltWn ’ i eac r ° f the 2)800 gas st ations is part of one clus- 

Thiis ! nCe h ea ^ h ° f the , lr P robabllltie s of being selected is 100/560 
Thus each and every elementary unit has the same chance of being 

S ! SltLitog““ plint Thi ’ pro|, * rtr “ “* “W- »»• 

You will notice that from Examples 5.9 and 5.10 that both simple 
random sampling and cluster sampling are methods of selection 
which give each elementary unit a known chance of appearing in a 
sample and that probability theory enables us to determine these 
chances. While these chances are all the same in the preceding il- 
lus rations, it is important to recognize that for some types of prob¬ 
ability sampling these chances at times are unequal. However equal 

s°amp7e q ’ 7 ^ * B0WB f ° r the ^ * * probSy 


eqfal mofabmtief IT UluStratio “ of a sam P Iin !? Procedure with un¬ 
equal probabilities, let us reconsider stratified sampling Stratified 

tionttT? T f 0t , ed “ Chapter 4) inV0lves Partitioning a popula¬ 
tion mto strata of elementary units and selecting a simple random 
sample from each stratum. ple ran<lom 

1 no ThUS ’, SUpP0Se a ® rm ’ s management planned to select a samnlp nf 

°l e Tl 0yee f S t0 d ® termine the exten t of their health insuranceTov- 

or not 'T? t0 be US6d aS a basis for dec iding on whether 
0 «° ^ stltute a group health insurance plan 

The firm s employees first might be divided into three strata* A 
production workers, B_office workers, and C-others T hus aT 
sume that the sizes of the strata and the samples are as follo’ws: 


Stratum 

Number of 
Employees 

Sample 

Size 

A. 


60 

B. 

oUU 

qnn 

C. 

oUU 

1 no 

20 

Total.. . 

--..- 

LUU 

20 

hood 

100 
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Since a simple random sample is to be selected from each stratum, 
S probability of an employee being selected from a given stratum is 
equal to the ratio of the sample size to the stratum size. Thus, 


60 


= .100 


P(a Given Employee in Stratum A Being Selected) - g Q0 

20 

P(a Given Employee in Stratum B Being Selected) = ^ - .067 

20 

P( a Given Employee in Stratum C Being Selected) = ^ = - 200 • 

Note that even though the probabilities of an employee being se¬ 
lected vary from stratum to stratum, these probabilities are known 
Alternately, suppose that samples of 60, 30, and 10 were drawn 
from strata A, B, and C, respectively. What would be ^ prebabilrfy 
of an employee being selected from each stratum, m that case. 

The probability concepts and rules we have considered enable us 
to measure the chances of elementary units and combinations of ele¬ 
mentary units appearing in a sample. This is important m the theo¬ 
retical development of statistics. But probability theory does more 
—it provides us with a scale on which we can measure risks of rely- 
ing on sampling information. 

Example 5.12 A manufacturer receives shipments of a destructi¬ 
ble product in lots of 100. He uses small samples to decidehe 
not to accept the shipments. One statistical decision rule that he 

uses is: 

Select one item at random. If the item is nondefective, accept 
the lot. If the item is defective, select a second. If this is non- 
defective, accept the lot. If the second also is defective, reject the 

lot. 

Schematically, we may view this decision rule as follows: 



Since this decision rule is based on a sample, there are risks in 
applying it. These risks may be measured m terms of probabi ty. 
However, it is important to recognize that they’depend °” *'* 
state of the world is true. That is, if we measure the» qu ahty of a lot 
by tt, the risks of accepting or rejecting the lot depend on the true but 
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unknown value of 7 r. Because its value is unknown, we consider var¬ 
ious possible values of 7 r in order to evaluate the decision rule. 

For example, suppose a lot of 100 contains 1 defective—that is, 

7 T = .01. The above rule cannot lead to such a lot being rejected. To 
reject any lot we must observe 2 consecutive defectives—with only 
1 in a lot this is impossible. Hence, the probability of rejecting any 
lot with 7 r = .01 is zero—it cannot occur. 

Suppose we consider another possible state of the world, tt = .50— 
that is, 50 of 100 items in the lot are defective. What are the chances 
of selecting 2 defectives in a row from such a lot and, hence, of reject¬ 
ing it? Using the conditional probability rule we find 

P(Rejecting Lot with t — .50) = 

P(Defective, Defective) = x 

x \j\j yy 

= 2,450 

9,900 

= .248 . 

The probability of rejecting any lot with tt - .50 is only .248. The 
probability of accepting such a lot is 1 - .248 = .752. This indicates 
that we run the risk of accepting lots with as many as half of the 
items defective approximately three quarters of the time. 

Thus, the risks involved in relying on the suggested decision rule 
are very large. Nevertheless, such rules have been and are being used 
in industry. The application of some simple probability theory, how¬ 
ever, shows the large risks of such a rulh 

You might compute the probability of rejecting lots for other val¬ 
ues of 7 r besides .01 and .50. Obviously, there are many. But always 
remember these probabilities depend on which state of the world we 
assume to be true. 

In the preceding illustration the sample size was small. In con¬ 
trast, suppose our sampling process involved the selection of a ran¬ 
dom sample of 200 items from a lot of 5,000 units. The decision rule 
might then be: If 5 or more items are defective, accept the lot; 
otherwise reject the lot. We could calculate the risks of this decision 
rule as we did before, but the arithmetic would be arduous. It would 
involve the multiplication of 200 fractions. Fortunately, techniques 
have been developed to simplify such computations. This will be 
part of the subject matter of Chapter 6. 

Now, however, suppose that we turn to other important concepts 
of probability theory; concepts which will be useful to us as theo¬ 
retical tools in our study of statistical decision making. 

5.5 Independent Repeated Trials 

When repeated trials are independent rather than dependent, a 
special case of the conditional probability rule may be applied. This 
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special case is frequently encountered in statistics, but first may be 
illustrated by reference to a game of chance. For example, what is 
the probability of obtaining two heads in two tosses of an unbiased 
coin? Applying our conditional probability rule 

P(H and H) = P(H) X P(H\H) . 

That is, the probability of a sequence of two heads is equal to the 
probability of the first head multiplied by the probability of a second 
head given that the first toss is a head. Thus, 

P(H and H) = i X i 

= T • 

You will observe that since the coin is unbiased, the probability of a 
second head is not influenced by the occurrence of the first head. The 
probability is 1/2 in both cases. In other words, the conditional prob¬ 
ability P(H\H) is the same as the unconditional probability, P(H). 
This always is the case when the outcomes under consideration result 
from independent trials. 

We now can restate the conditional probability rule for the special 
case. When trials are independent, 

P(A and B) = P(A) X P(B) . 

The rule can be extended to more than two trials as follows: 

P(A AND B AND C AND . . . ) = P(A) X P(B) X P(C) X . . . . 

We are especially interested in the rule for independent trials when 
each trial can result in one of two mutually exclusive outcomes. For 
example, a coin can be a head or a tail; a television cabinet can be 
defective or nondefective; a person can be male or female; a fam¬ 
ily’s income can be less than $10,000 a year, or $10,000 or more a 
year. In our discussion of repeated independent trials we will con¬ 
cern ourselves only with such outcomes. 

The rule for independent trials is applicable in answering ques¬ 
tions such as the following: What is the probability that in a simple 
random sample of 100 entries from an accounts payable ledger there 
will be 2 incorrect entries and 98 correct entries? Suppose that we 
are concerned with the system of recording ledger entries. Hence, we 
are dealing with an analytical study. Our population is, therefore, 
infinite—consisting of the ledger entries past, present, and future 
made under similar conditions. The probability of an entry being se¬ 
lected as the second unit in a sample is not conditional upon the se¬ 
lection of the first entry. When 1 entry is chosen, there still remain 
an unlimited number of entries. 
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Suppose that we reconsider an earlier illustration to indicate the 
use of the rule for independent trials in the evaluation of the risks 
of relying on a decision rule. 

Example 5.13 A firm which has a policy of attempting to fill 
orders within a week’s time has reasons to suspect that its shipping 
process has slowed down. In the past it has been determined that 
only 5 per cent of its orders were delayed more than a week—and 
ns is regarded as satisfactory. Now, however, several customer 
complaints hint at the possibility that tt, the proportion of delayed 
orders, has increased from its normal and satisfactory level of .05. 

Note that this situation calls for an analytical study. It deals 
with a (shipping) process and, therefore, presents an infinite popula¬ 
Suppose that the firm plans to select a sample of 3 orders as a basis 
for deciding on whether or not to attempt to improve the shipping 
process. How will they use the information obtained from this sam- 
p v \ t ? on f ^ 3 shipments is delayed, the shipping process will 
not be checked; if 1 or more are delayed, the process will be investi¬ 
gated with an eye towards revising it. This is their statistical de¬ 
cision rule—but what is the risk of applying it? Actually there are 
many risks. 

One .is the risk of investigating the process because the sample 
shows 1 or more delayed orders, even though tt has remained at its 
normal and satisfactory level of .05. If we assume that the 3 orders 
selected for the sample are filled independently, this risk may be 
computed by the use of the rule for independent trials. Thus if 
tt ~ .05, for a single order selected at random, 

P(Delayed Order) = .05 
P(Undelayed Order) = .95 . 

Then the chance of not investigating the process is the chance of se¬ 
lecting 3 successive undelayed orders, namely, 

P (Undelayed and Undelayed and Undelayed) 

— P(Lndelayed) X P(Undelayed) X P(Undelayed) 

- .95 X .95 X .95 ' 

- .857 . 

It follows that the probability of investigating if ^ — .05 is 1 - .857 

- .143. This is the risk of checking a normal and satisfactory proc¬ 
ess. 

Another risk is that the sample will lead to not investigating the 
snipping process when tt actually has increased significantly For ex¬ 
ample, suppose tt = .25. Then, 

P(Delayed Order) = .25 
P (Undelayed, Order) = .75 
and the chance of not investigating the process is 

.75 X .75 X .75 = .422 , 

while the chance of investigating is 1 — .422 = .578. 
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Similar calculations indicate some of the other probabilities of m 
vestigating and of not investigating the process are as follows: 


7T 

P (Investigating) 

P(Not Investigating) 

oo . 

.000 

1.000 

05 . 

.143 

.857 

10 .. • 

.271 

.729 

15 . 

.386 

.614 

20 . 

.488 

.512 

.25. 

.578 

.422 


Check one or two of these values. Also, notice that these probabilities 
are all characteristics of the decision rule under consideration To 
evaluate that rule you should consider these chances or risks. Do they 
seem economical or not? Why or why not? 

The last illustration affords an opportunity for us to make note 
of an important thought. You can see that the probability of in¬ 
vestigating the process increases as * increases. Generally speaking, 
this means that the more the shipping process has deteriorated, t e 
better our chances of taking the right action—of checking the proc¬ 
ess. Thus, for it = .10, this probability is .271, while for ir = .25, it is 
.578. For less and less desirable states of the world the probability 
of taking the right action gets higher and higher, and the risk of 
making an incorrect decision gets lower and lower. This is true de¬ 
spite the fact that the sample size is quite small. What would a 
larger sample accomplish? It would mean smaller risks for the dif¬ 
ferent values of *•. 

5.6 The Binomial Formula 

In the proceeding section we used the rule for independent trials 
to evaluate the risks in relying on sample information from an in¬ 
finite population. While the method of calculation we used to com¬ 
pute these probabilities is relatively simple for small samples, a more 
generalized procedure would facilitate the determination of prob¬ 
abilities of outcomes for any size of sample. 

Such a generalized procedure is afforded by mathematical prob¬ 
ability. It takes the form of a formula called the binomial formula. 
This formula simply represents an extension of the rule for independ- 
ent trials. It also is related to the binomial theorem or binomial ex¬ 
pansion, a relatively simple mathematical relationship of importance _ 
in algebra. Perhaps you remember studying the binomial; if so, you 
will recognize the relationship between it and what we are studying 
here. If not, don’t worry about it—just remember that the formula 
we are studying is called the binomial. 
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Let us indicate the derivation of this formula. Suppose that we 
have n independent trials, such as n selections from an infinite popu¬ 
lation, and that each of the n trials can result in only one of two 
outcomes. To keep a concrete example in mind, envision a produc¬ 
tion process producing n parts each of which may be either defective 

dependent^ efeCtlVe ^ ^ aSSUme these parts are Produced in- 

What is the probability of a particular sequence? If we let 


P(One Defective Part) = w 
P(One Nondefective Part) == l — w } 

% quenclD^N^D i s ° r independent trials ’ the Probability of the se- 


or, in short, 


7T X (1 ~ 7r) X 7T , 


7T 2 (1 - 7r) . 

Similarly, the probability of the sequence N, D, D is 

(1 7 r) X 7 T X 7 T = 7 T 2 (1 — 7 r) , 

while that of the sequence D, D, N is 


71 A vr A L 


You should note that all three of these sequences refer to 1 non- 
defective and 2 defectives although they refer to their occurrence 
m different orders. It is no surprise, then, that the probability of 
each is - (1 -*). Now, it is important to recognize that the out- 
come D occurs twice m each sequence, and, hence, the exponent on 
it, the probability of D occurring, is 2. Since N occurs once the ex¬ 
ponent on (1 — tt), the probability of N occurring, is 1. 

Suppose, now, that we are not interested in the order of occur¬ 
rence and that we simply need to know the probability of 1 nonde¬ 
fective and 2 defectives in three trials. This is the probability of 
observing N D D or D N D or D D N and equals 


ir 2 (i - ir) + x^l - ir) + ,1-2(1 - T ) = 3 tt 2 (1 - j-) 

because they are mutually exclusive possibilities. 

Each of the terms, 3, iP, and (1 — ■*) has an interpretation. We 
have already noted that the exponents on ,r and (I — tt) represent 
the frequency of D and N in a sequence. A coefficient, such as the 3 
indicates the number of sequences in which, say, 1 nondefective and 
l detectives can occur. 
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Thus, if we were interested in the probability of 2 defectives and 
2 nondefectives, we know that we must have times some 

coefficient. What does this coefficient equal? There are six orders of 
two D *s and two N% namely, 

DDN N N DDN 

DNDN N D N D 

DN N D N N DD 

and hence the coefficient is 6. Thus, we may write 


p (2 Defectives and 2 Nondefectives) = 6tt 2 (1 - tt) 2 . 

The binomial is a generalized formula for computing the prob¬ 
ability of a given series of outcomes in any order. Thus, consider n 

trials in which 

r are defective, and 


n — r are nondefective. 


The probability of any such sequence is 

x r (1 _ T ) n-r 

since the exponent on *■ refers to the number of outcomes that are 
defective and the exponent on (1 — 0 refers to the number of out- 
comes that are nondefective. 

Furthermore, the probability of r defectives and n — r nondefec- 
tives in any order is *'(1 -»)*-' multiplied by the number of se¬ 
quences in which r defective and n - r nondefective can occur. This 
number of sequences may be expressed in terms of factorials as 

n \ ' 
r \ (n — r)! 

Factorials, as you should recall, were defined in Chapter 4 where a 
similar formula was developed for determining the number of pos¬ 
sible samples that could be drawn from a population. With n - 4 
and r — 2, we find 

4! _ 4 X 3 X 2 X 1 = g 
2! 2! 2 X 1 X 2 X 1 

a result which we also found earlier by direct enumeration. 

Thus, if we write p{A) for the probability of r outcomes of a spe¬ 
cified type (such as defectives) in n trials, 
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n = number of trials, 
r ~ number of specified outcomes, 
n — r — number of other outcomes, 

ir = probability of a specified outcome in a single trial. 

Th e binomial, as noted, is a general formula for computing prob 

c«mnu” n rrii epe , ntl r l tri * ls ' “ d “ SUOh is “ eful “ “*««■ 'or 

observations are““ ““ Ple 

pmg the process when n is only .02 fa aualifv IpvpI * j ^ 

satisfactory) 7 The binomial provides the answer. The probability C of 

wssas^assr- 1 * - «*»- ■ ^ 


n = 5 , r = 0 , *■ = .02, 

thus finding, for the probability of 0 defectives, 

p © = om ^ 02 )° (- 98 ) 6 

= (.98) 5 

/TO = - 904 • 

(Remember that 0 ! = 1 and (. 02 )° = 1 .) 

Similarly, for the probability of 1 defective, 


P © = ifi! ( - 02 )‘ (- 98 ) 4 


= 5 (.02) (.98) 4 
= .092 . 


fhese two results may be combined to show the probability of 0 or 
1 defective (which is the probability of not storing Se^cih 


.904 + .092 = .996 . 


Thus, the risk of stopping the 
.004. Does this mean that the 
one? Why or why not? 


process if 7 r = .02 is only 1 — .990 = 
production manager’s rule is a good 


n ortunately even the binomial formula involves more and 

Theref° mP - 1Ca cn d COmputations for ,ar g er and larger sample sizes 
see tm f °!-4' “ ^ cha P ter we wil1 return to the binomial and 

° f pt< "‘ bi,i<ie * mw be ■»«. 
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5 7 Expected Values 

In this chapter we have considered ways of measuring and com¬ 
puting probabilities of various outcomes m a random experiment or 
a series of random experiments. In this particular section we shall 
deal with an allied topic —expected value. This is the.average|va ue 
that would occur if the random experiment were to be repeated m- 

dC Srexample, suppose a die is thrown and the face value observed. 
The average value observed after many throws is what is meant by 
the expected value. What is this expected vriue? Assuming each 
side equiprobable, we know that each face value-1 2, 3 - 4 > a d 

6—is equally likely; that is, the probability of each is / 6 . By a 
sumption, then, in the long run each of these values would occur 
y 6 of the time. The average value is thus, 


l (1) + l (2) + J (3) + | ( 4) + l (5) + g (6) - 6 1 


3.5 , 


and the 3.5 is called the expected value of the random variable under 

consideration. . ,, 

Generally speaking, the expected value of any random variable 
may be computed by (a) multiplying each possible value the vari¬ 
able may assume by its probability of occurrence, and (b) summing 
these products. This is the method used m the preceding paragraph 
-each value (1, 2, 3, 4, 5, and 6) was multiplied by its probability 
of occurrence (Ye), and these results were added together. 

Example 5.15 Suppose that on the basis of his past experience, a 
newsdealer estimates the probabilities of selling 7, 8, 9, or 10 maga¬ 
zines per hour as follows: 

P( 7) = .2 
P(8) = .4 
P(9) = .3 
P(10) = .1 • 

The expected number of magazines is then 

7(,2) + 8(.4) + 9(.3) + 10(.l) = 8.3 . 

This is an average value-an average that might be expected in the 
long run. 

Example 5.16 A manufacturer receives shipments of a special part 
in lots of 50. He uses the following decision rule to decide whether or 
not to inspect the entire lot: 
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Select a simple random sample of 5 parts. If none of the 5 se¬ 
lected parts is defective, do wot inspect the remaining 45 parts, 
or more are defective, inspect the remaining 45 parts. 

The manufacturer desires to know the expected amount of inspec- 
rX M ° rder t0 ® Stlmate thc cost of inspection under this decision 

The total amount of inspection for any one lot depends upon the 

action taken. Note the tabulation that follows: upon me 


Action 

Number of Items Inspected 

Total Number 
of Parts 
Inspected 

Sample 

Remainder 
of Lot 

No further inspection. . . 
Full inspection. 

5 

5 

0 

45 

5 



50 


ftnm° r l low ® ver > i'i'e ' t °h a l number of parts inspected is a ran¬ 

dom variable since it results from a random experiment of testX a 
samp e of 5 parts. To find the expected amount of inspection H is 

“x e r:rxrtbr: the probabiiity ° f each 

P(No Further Inspection) = P( 0 Defectives in a Sample of 5) 

= 45 44 43 42 41 

50 X 49 X 48 X 47 X 46 
= .577 . 

tzxssr - io - “••—»** 

We can compute the expected amount of inspection: 

F(Amount of Inspection) = (5 X .577) + (50 X 423) 

= 24.0 . 

iS’stzt:!, 2 ’ -*■* 

tloXr eV6 !r r ’ y0U ™ Ust remember that the expected amount of inspec¬ 
tion depends on the probabilities of no inspection and of full inspec- 

wTT+X he " Ce ’ f ° n the P r °P° rtion ° f defectives in a lot We have 
found the answer for *• = .10, and similar calculations would enable 

?* “sr s, ;&%%£?' rB 

“fZ’Xn ft ’'** “>= « 

An expected value is a special type of arithmetic mean. That is 
is an average value. It is special only in the sense that it is the 
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average value of a random variable-the average of numbers which 

are the result of a chance experiment. 

Thus, it must be stressed that the term “expected value _ does not 
refer to what we expect to occur on any one trial-m any angle ran, 
dom experiment. An expected value is an average of what would 
happen only if such an experiment were repeated over and over 
again (an unlimited number of times). Thus, to re er ac o 
fact that the expected value of the outcome of throwing a die i 3 5 
we can see it is impossible to get a 3.5 on a single tossXertamlj^ this 
is not what we should “expect” on any one throw The 51.5^smiply 
indicates the average value that we would expect to o 

^Uncustomary in mathematics and statistics to use the symbol 
E to represent the term “expected.” Thus, if we let * refer to t e 
face value that occurs when a die is thrown £(*) refers to the 
“expected value” of this experiment and E(x) • 

The concept of expected value is useful in many connections m 
probability theory and in its fields of application. In statistics it is 
used primarily in connection with evaluating sampling procedures 
and hence, in determining sound statistical decision procedures. 

As has been stressed in this chapter, statistics such as the sample 
mean (5), the sample variance (s 2 ), and the sample proportion (p) 
are random variables. Since they are random variables we may ap¬ 
ply the idea of expected value to them. This point will be discussed 

more fully in the next chapter. . . , 

Note for the present, however, that by introducing the concept 
of expected values we have introduced a third application of t e 
average which is useful in statistics. Thus, n represents one use for 
the average—it represents the average value m a population. A sec¬ 
ond use for the average is to summarize information m a sample 
in this connection an average is denoted by x. Thirdly, any ex¬ 
pected value is an average-the long-run average value of a random 
variable. These uses for the concept of the average are all impor¬ 
tant, and they are all qualitatively different. Don’t confuse them. 

5.8 You Should Now Know That 

A statistic is a property of a sample. It summarizes some informa¬ 
tion contained in the sample. m 

Statistics are variables. They vary from sample to sample. ^ hen 
they are properties of probability samples, they are random variables. 
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»tt.X” h " h th ‘ y VillT ea " be PKdicW »” »“« of prob. 

Statistics of present interest and their symbolic names are: 

p. the sample proportion 
the sample arithmetic mean 
! : sam P Ie standard deviation 

5 . the sample variance 

«J meth„°2 ° f Pr ° b * bil “ ! ' “ * f ° Un “« «*»"' of .tafctal theory 
J .pmericl measure of probabi,i, y i, died . probability mea,- 

ouXXT "X Xf ! d “ ! h ° r P ° rti “ of .“” s “ 

en«”2 (S T “'“XXaitVXf pit tperi 

p»b, WIty b, it „ d B .r™ to v p x^„ii p ,!, u ’ u,e 

Repeated trials are dependent if thpv , 

are independent if they do not infWn fl 01 , le mother; they 

«l For two dependent trial, thi. ml, „, y ^ ,*p„,«ed „ 

P(A AND B) = P(A) X P(B\A), 

M S" 1 ,! css '” b ‘ biii ‘F-th, P ,„ b . bilily B 

b) For two independent trials this rule becomes 
P(A and B) = P(A) X P(B) 

triak 86 P( ' B ^ ~ P( ' B) lf A and B outcomes in independent 

£ 5 H=:i-SF-" 

5.9 You Should Now Be Able to Solve 

inchJ 1 TTn n r an f meter of a shi P lnent of 500 ball bearings is 248 
mchea. Under what orreumetanoe, would thi, average be ,S 
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parameter (,)? Tinder what eircumetaneee would it be a atatia- 

Mortality tablee tel, ua, Jo, ^ 

father not to purchase a tire insurance P UA y 
Should the student pass statistics? Why or why not? 

4. Explain why each of the following is either true or fa s . 

b) Srprotobflity'that a television cabto 

bl h :i.U ?S-* or’ . de, 1. 

c) Ss t“.SS “ “ 

d, It the*probability .1 m^^tSVSSS^ ft? 

ability that 2 successive 1 e ^ gome ru i erSj however, are 

e) A company manufactuies shorter. If the probability of a 

SKfiSSSTS'iX'i‘2, the Ptob-ity . * 

being shorter than 12 inches is .60. 

5. On the baaia of paat experience, it la est ™dljdyidt t ^ p ^ 
hility of a ^“eeksi Wh.t la 

w r;=o::o“;— 

“X;“ SS^.’S.'S 

Sr^cSuSL'do ^'thhih to* the deciaion rule off.ra 
aaainat accepting Ma of inferior qn.hty? 
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7. Suppose the error rate in a posting operation is 3 per cent. If a 
sample of 10 postings is observed, what is the probability of finding 
no errors? Two errors? Five errors? 

8. A sample of dwelling units is to be drawn by selecting a block at 
random and including every dwelling unit in the block in the sample. 
The blocks and the number of dwelling units in each are as follows: 


Block 

A. . 

B. . 

C. . 

D. . 

E. .. 


Number of 
Dwelling Units 

.10 

. 9 

. 12 

.14 

. 8 


What is the expected value of the sample size? How do you inter- 
pret this value? 


5.10 You Will Also Find That 

Some of the basic principles of the mathematical theory of prob¬ 
ability, remarks on its applicability, and notes on its historical de¬ 
velopment are discussed in the following references: 

Cramer, Harald. The Elements of Probability Theory, pp 11-28 New 
York: John Wiley & Sons, Inc., 1955. 

Polya, G. Mathematics and Plausible Reasoning, Vol. II: Patterns of 
Plausible Inference. Princeton, N.J.: Princeton University Press, 1954. 

Probability and its relationship to statistics are treated by: 

Wallis, W. Allen, and Roberts, Harry V. Statistics: A New Ap¬ 
proach, pp. 309-41. Glencoe, Ill.: The Free Press, 1956. 

Hirsch, Werner Z. Introduction to Modern Statistics, pp. 66-90 New 
York: Macmillan Co., 1957. 

D ^ C „ A , N ’ t , AcHESON J ' Qualit y Contro1 and Industrial Statistics, pp 
13-26. Rev. ed. Homewood, Ill.: Richard D. Irwin, Inc., 1959. 








CHAPTER ' 6 


Sampling Distributions 


6.1 Sampling Distributions Defined 

We now have discovered that statistics (measures computed from 
sample data) can be used in place of decision-parameters, when the 
latter are unknown. We have also seen that risks are connected 
with this procedure since the value of a statistic will vary depending 
on the particular sample selected. Whenever the sample which a 
statistic represents is a probability sample, this variation is random 
and is to some extent predictable. It is, so to speak, governed by the 
laws of chance and the theory of probability. Therefore, in the pre¬ 
ceding chapter, we acquainted ourselves with some basic probability 
theory and used this theory to evaluate risks of relying on sampling 
procedures. Our discussions, however, were limited to small-size 
samples and to situations where the observations made on the char¬ 
acteristic of interest were attributes (as opposed to variates). We 
shall now investigate the pattern of sampling variability as an ex- 
tension of our previous discussions. 

First, let us consider what statisticians mean by the expression 
“pattern of sampling variability.” It refers to the distribution of all 
possible values that some given statistic can assume, plus the respec¬ 
tive probabilities of those values occurring. Such a distribution is 
called a sampling distribution or, synonymously, a probability dis- 
tnbution 

We will use sampling distributions to summarize the probabilities 
of different values of a statistic occurring. Their use will enable us to 
calculate risks of relying on such statistics. As you will see, it is some¬ 
what easier to compute risks for large samples by the use of sampling 
distributions than by the simple methods discussed in the last chap- 
ter. 
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Now let’s look at the illustration that follows to discover some gen- 
eral characteristics of sampling distributions. 

Example 6.1 Consider a hypothetical small town in which all the 
cars have license plates numbered serially. That is, one car has the 
plate number 1, another has 2, still another has 3, etc 
. S «PP° se ) h at there were 6 cars in town. A statistician is to select a 
simple random sample of 2 cars. He defines a statistic, g, as the 
highest serial number in his sample. What values of g are possible 
and what are their probabilities of occurring? To answer this ques¬ 
tion suppose we list all possible samples, their probabilities of occur¬ 
rence, and the value of g associated with each sample. 


Sample 

Probability 

9 

1,2. 

i/ e 


1,3. 

A 5 
i/ E 

z 

1,4. 

1 5 

o 

1,5. 

/\ 5 

Mb 

ix - 

4 

1,6. 

O 

2,3. 

A 5 

IX- 

O 

2,4. 

A 5 

IX- 

6 

A 

2,5. 

A 5 

IX- 

4 

2,6. 

A 5 

IX- 

o 

a 

3,4. 

A 5 

IX- 

O 

A 

3,5. 

A 5 

1/ _ 

4 

c 

3,6. 

A 5 

Uk 

o 

a 

4,5. 

/l 5 

IX- 

O 

K 

4,6. 

A 5 

IX- 

o 

a 

5,6. 

A 5 
\X- 

o 

a 

------- 

Ah 

O 


we can rearrange this tabulation 
and their probabilities of occurrence. 


9 

2 . 

3. 

4. 

5. 

6 . 


Probability 

••Mb 
• • • /b 5 

.-•^5 


This table represents a probability distribution or sampling distribu- 
tion of the statistic g i/ there are 6 cars in town. It tells us, for exam- 
pie, that if there were 6 cars in town and we were to take a large 
number of samples, 4 out of 15 times the largest number in our sam- 
p e would be a 5, 1 out of 15 times the largest number would be a 2 

otr* f 


i 


Notice that in this example the sampling distribution referred to a 
specific statistic (g), to a specific method of sampling (simple ran- 
om), to a specific sample size (n ~ 2), and to specific conditions in 
the population (6 cars in town). If we had varied any of these con¬ 
ditions, say, by considering a sample of 5 or by assuming the popu- 
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lation contained 100 cars, we should get a different sampling dis¬ 
tribution Therefore, you should remember that whenever we speak 

ZXling distribution „s must speak of it wild, refasne. to: 

1 . A given statistic, 

2. A method of sampling, 

3 . A sample size, and 

4. A specific population. 

The method of determining a sampling distribution that we have 
used in the last example is completely valid. However it is not very 
effective from a practical point of view. For example, if the popula¬ 
tion is infinite, it is, of course, impossible to write down all pore* 
samples of a given size-there are an infinite number. Even if the 
population contains a finite number of observations enumeration 
usually is impractical. Thus, there are 75,287,520 possible samples of 
size 5 that might be drawn from a population of 100. You cou d 
spend your entire college career just computing all possible values of 
a given statistic. Therefore, in this chapter we must develop more 
sophisticated methods of determining the sampling distribution 

of a statistic. 

6.2 The Sampling Distribution of p 

The sample proportion (p) is one of the statistics used frequently 
in statistical decision problems. Let’s consider its sampling distribu¬ 
tion This distribution presents all possible values of the sample pro¬ 
portion and the probabilities of their occurrence 

In the preceding chapter we developed an analytical tool t e 
binomial formula-to calculate the probability of r occurrences of a 
given attribute in a sample of n items. However, p, the sample pro¬ 
portion, is by definition r/n. For example, if m a sample of 5 items, 
2 are defective, p = .40. Hence, the probability of 2 defectives m a 
sample of 5 items is the same as the probability of p — .40. i ere- 
fore we may use the binomial formula to determine the probability 
of all possible values of p. Remember that the sampling distribution 
of p is expressed exactly by the binomial if and only if we are con¬ 
cerned with simple random samples from an infinite population o 
attributes. Furthermore, the distributions we derive must always 
refer to a fixed sample size and specified conditions about the popu¬ 
lation (a fixed *•). 

Let’s restate the binomial formula: 


p(z = p) = 


n\ 




*-(i - • 
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In applying it for a given n and tt we let r equal 0, 1, ... , n in or¬ 
der to get a set of n + 1 probabilities, each of which is associated 
with a certain r or p. This set of probabilities is called the sampling 
distribution of p. However, for different values of n or tt we obtain 
different probability distributions. 

Our main task in this section is, therefore, to consider the effect 
of different values of n and *r upon the sampling distribution of p. 
To start, let’s fix n at 5 and see what the probabilities of all possible 
values of p are when tt = .05, .25, .50, .75, and .95. By successive ap¬ 
plication of our binomial formula for r — 0, 1, 2, 3, 4, and 5, for 
n = 5, and for the appropriate values of tt, we find the following five 
sampling distributions: 


p 

TT = .05 

TT = .25 

TT — .50 

7T = .75 

, ..’Jl_* 

TT = .95 

.00. 

.774 

.237 

.031 

.001 

.000 

.20. 

.204 

.395 

.156 

.015 

.000 

.40. 

.021 

.264 

.313 

.088 

.001 

.60. 

.001 

.088 

.313 

.264 

.021 

.80. 

.000 

.015 

.156 

.395 

.204 

1.00. 

.000 

.001 

.031 

.237 

.774 


The five probability distributions of p, tabled above, are shown in 
Figure 6.1. Look at them for a minute or two so that we can discuss 
them with a clear picture in mind. First of all, notice that the sam¬ 
pling distribution of p for tt = .50 is symmetrical In short, the left 
side is a mirror image of the right, and vice versa. However, the 
other four distributions are skewed. The distributions for tt = .05 
and tt = .25 are skewed to the right. Alternately, the distributions 
for tt = .75 and tt = .95 are skewed to the left. In addition, we no¬ 
tice that those distributions for tt — .05 and tt = .95 are more skewed 
than those for tt = .25 and tt — .75. These observations permit the 
following generalizations: 

1. The sampling distribution of p for ,r = .50 is always symmetrical no 
matter what the sample size. 

2 . The sampling distribution of p is skewed to the right if tt is less than 
.50. It is skewed to the left if tt is more than .50. 

3. The further tt departs from .50,, the greater the skewness pf the 
sampling distribution of p, for a given sample size. 

In addition, notice that the distributions of v = .25 and tt = .75 are 
mirror images of each other. Similarly, the distributions for tt '= .05 
and tt == .95 are mirror images. Why? 

It is important to observe that the most probable values of p 
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always are in the neighborhood of the value of tt. Thus, if * = .05, 
low values of p are the most likely and high values of p are the least 
likely. On the other hand, if *r = .95, high values of p are the more 
likely—low values of p may occur, but their chances are small. 

Figure 6.1 


PROBABILITY 

PROBABILITY 

.800 

- 

.800 

- 

.700 

- 

.700 

TT= .05 

- 

.600 

- 

.600 

— 

.500 

- 

.500 

- 

.400 

- 

.400 

- 

.300 

- 

.300 

- 

.200 

- 

i - 200 

- 

.100 

- 

I .100 

_1_I----- n 

— 


Tr= .25 


.00 


.20 .40 .60 

VALUES OF P 


.80 1.00 


.00 


.20 .40 .60 

VALUES OF P 


.80 1.00 


PROBABILITY 


PROBABILITY 

.800 
.700 
.600 
.500 
.400 
.300 
.200 
.100 
0 



Tl = .75 


.00 


.20 .40 .60 

VALUES OF P 


.80 1.00 



Now let’s consider what happens to the sampling distribution of 
p as we increase the sample size. In Figure 6.2 sampling distribu¬ 
tions of p are portrayed for sample sizes 5, 30, and 150 when tt = .25. 
The probabilities needed to construct these charts were derived by 
using the binomial formula repeatedly. One thing you notice im- 



PROBABILITY 
.40 n 



PROBABILITY 
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mediately is that there are more vertical lines on the graphs as n 
increases —p can attain a wider variety of values when n is large. 
More important, the concentration about the value of v gets more 
and more pronounced as the sample size increases. That is, the prob¬ 
ability that p will be approximately equal to * gets greater and 
greater. 

Lastly, we should observe that as the sample size increases, the 
sampling distribution of p becomes more nearly symmetrical and 
more nearly bell-shaped. In fact, as n gets larger, the distribution of 
p tends to resemble that of a normal curve more and more. This ob¬ 
servation can be proved mathematically. That is, for sufficiently 
large simple random samples, it may be shown that the sampling 
distribution of p for any given value of ^ may be well approximated 
by a normal curve. 

If the sample size is large enough, therefore, we can calculate 
probabilities of values of p by means of the normal curve rather 
than by the binomial formula. But, what do we mean by “large 
enough”? We know that for small samples, the binomial distribu¬ 
tion is symmetrical when ^ is .50, but it becomes more and more 
skewed as ?r departs from .50 in either direction. This indicates that 
the more v departs from .50, the greater the sample size needed for 
the normal curve to offer a satisfactory approximation of the bi¬ 
nomial distribution. The table that follows presents general rules 
suggested by one statistician to decide whether n is “sufficiently 
large.” It indicates the smallest value of n for which the normal ap¬ 
proximation should be used if ^ is a certain value. 

Use the Normal 

Approximation 


jf T Only if n Is at Least 

Equals Equal to: 

.5. 30 

.4 or .6. 30 

.3 or .7. 80 

.2 or .8. 200 

.1 or .9. 300 

.05 or .95.1,400 


Reprinted with permission from William G. Cochran, Sam¬ 
pling Techniques (New York: John Wiley & Son, Inc., 1953), p. 41. 

6.3 Characteristics of the Sampling Distribution of p 

In the last section we developed a second important use of the 
normal curve—as an approximation of the sampling distribution of 
p. Heretofore, in Chapters 2 and 3, we considered the normal curve 
as a population model. At that time, we saw that areas under the 
normal curve—in other words, the relative frequency with which 
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observations were contained in a given interval—could be deter¬ 
mined if we knew the population mean and standard deviation. Now 
we similarly could determine the relative frequency with which sam¬ 
ple proportions fall within a given interval if we knew the mean and 
standard deviation of the sampling distribution of p. For example, 
we would be able to say that approximately 68 per cent of all pos¬ 
sible sample proportions would be included in an interval of plus and 
minus one standard deviation about the mean of the sampling dis¬ 
tribution. The 68 per cent, in this case, represents the relative fre¬ 
quency of occurrence of outcomes of a random variable—the sample 
proportion. Therefore, in keeping with the relative frequency defi¬ 
nition of probability, the .68 may be also interpreted as a probability 
Thus, areas, relative frequencies, and probabilities are equivalent 
concepts when the normal curve is used to describe a sampling dis- 
tribution. 

But to measure these probabilities we must first know the mean 
and standard deviation of the sampling distribution of p. Let us first 
consider the mean. We already have mentioned this type of arith¬ 
metic mean in another connection. In Chapter 5 we pointed out 
that the expected value of any random variable was the average or 
the arithmetic mean of the values that would occur in the long 
run. At present we are interested in a particular random variable— 
the sample proportion, p. Therefore, we are interested in the ex¬ 
pected value of p—the arithmetic mean of the sampling distribution 
of p. Symbolically this is represented by E(p). 

Example 6.2 Consider the sampling distribution of p for n = 4 

and = .30. The results were derived through the use of the binomial 
formula. 


Probability 
. . .2401 
. .4116 
. .2646 
. .0756 
. .0081 

To calculate the expected value of p we must multiply each possi¬ 
ble value of p by its probability of occurrence, and then add the 
products. 


V 

.00 

.25 

.50 

.75 

1.00 


E(p) ~ (.00) (.2401) + (.25) (.4116) + (.50) (.2646) 

+ (.75) (.0756) + (1.00) (.0081) 

= .0000 + .1029 + .1323 + .0567 + .0081 
= .30 

This value, .30, represents the average of the values of p that would 
occur m the long run—it is the “expected value” of p if a simple ran- 
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dom sample of size 4 were selected from a population in which 
7r — .30. 

Now, note that in our example, E{p) = .30 and that *■= .30. 
This is no coincidence—it is a general fact which can be proved 
mathematically. That is, whenever we use simple random sampling, 

E(p) = K • 

What does this imply? It tells us that in the long run the average 
sample proportion will be equal to the population proportion no 
matter what the value of this population proportion. 

Of course, this long-run business may not mean too much to us. 
After all, in practical applications, we ordinarily will select only one 
sample. Therefore, in addition to the expected value of p, we would 
like to find out something about the variability of p. Would it vary a 
great deal from sample to sample if we selected a series of samples. 
By how much, on the average, will any one value of p depart from 

the expected value of p? . 

To answer these questions we may turn to the standard deviation 
of p or, more fully, the standard deviation of the sampling distribu¬ 
tion of p> In short, we are going to consider a single measure of the 
variability of p. We will use the symbol <?„ for this measure. In ad¬ 
dition to being called the standard deviation of p, this measure 
usually is called the standard error of p. We will use the terms inter¬ 
changeably. Since the standard error of p measures the variability 
of all possible sample proportions about the population proportion 
(the mean of the sampling distribution of p), it is a measure of the 
reliability of sample results. Obviously, the smaller the standard 
error of a proportion, the closer the possible sample proportions will 
be to the population proportion and, hence, the more reliable will 
be the sample results. 

For any given sampling distribution, <? p or <r/ (the variance ot p) 
can be computed directly from the sampling distribution.. The var¬ 
iance of p represents the average of the squared deviations of 
all possible sample proportions about the population proportion. 
However, since p is a random variable, the squared deviation of p 
about n also is a random variable. But, we have called the average 
value of a random variable, the expected value. Therefore, we can 
say that <r* is the expected value of the squared deviations of p 
about *■ or 

*»* = E(p - *)* 
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Thus to obtain the variance of a random variable such as p, we 
must multiply each squared deviation by its probability of occur¬ 
rence, and then add the products. Let’s now apply this formula to 
the sampling distribution of p in Example 6.2 where n was 4 and v 
was .30: 


V 

Pip) 

(P “ *) 

.00 

.2401 

-.30 

.25 

.4116 

-.05 

.50 

.2646 

+.20 

.75 

.0756 

+.45 

1.00 

.0081 

+.70 




(p - Try 

2[(P - it) 2 X P(p)I 

.0900 

.021609 

.0025 

.001029 

.©400 

.010584 

.2025 

.015309 

.4900 

.003969 


Total.052500 


o-p 2 = .0525 
<t v — .23 . 

This is a tedious method of computing v P . However, the difficulty 
is eliminated by a simple formula which can be derived mathemati¬ 
cally. It states 




a - 7r) 
n 


Substituting the values of * and n from the preceding computation, 
we obtain the same result 

0X1-.30) 


-4 

= yj .0525 


.21 

4 


= .23 . 

This value of a v is a measure of the variability in a sampling pro¬ 
cedure. It describes the variability among values of p (for samples 
of size n) that would result with repeated simple random sampling 
from an infinite population in which it equals .30. 

Let us now consider some important aspects of <r p . You will notice 
from the formula for <r v that it depends on n, the sample size, and », 

the population proportion. Notice that it varies inversely with n _ 

that is, for larger and larger values of n, is smaller and smaller. 
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This expresses an obvious but very important fact. Ij we increase 
the sample size, the distribution of sample proportions will become 
less and less variable. In other words, the reliability of sample re¬ 
sults increases as the sample size increases. 

For example, consider the following values of o•„ for varying sam¬ 
ple sizes when the population proportion, tt, equals .5: 

n 

' 25.1000 

100.0500 

500.0224 

1,000.0158 

10,000.0050 

Thus, as n gets larger, <j p gets smaller. 

Now notice that the standard error of p varies directly with the 
value of 7 r(1 — 7 r). Below some values of tt(1 — tt) are tabulated 


for various values of *■: 


7r 

7r(l — 7r) 

\ no . 

.00 

an 

.09 

7 n .21 

50 . 

.25 


.21 

. . . . 

10 . . . . 

.09 

.00. 

.00 


You see that tt(1 — ir) is at a maximum when x = .50. As v departs 
from .50 in either direction, tt(1 — tt) gets smaller. 

The population proportion, tt, is related to the amount of varia¬ 
bility in such a population. For example, if all elementary units 
have, or do not have, a given attribute, tt equals one or zero, respec¬ 
tively. In either case all observations are the same and there is no 
variability. However, the more variable a population, the closer the 
population proportion will be to .50. The most variable population 
of attributes is one for which tt = .50. Thus, the more variable the 
population, the larger will be the value of *-(1 — *■). 

In summary, the larger the sample size or the less variable the 
population, the smaller <r p will be. The smaller cr p) the greater is the 
reliability in making decisions on the basis of sample proportions. 

Now that we have determined the mean and the standard devia¬ 
tion of the sampling distribution of p, we can use this information 
to measure sampling risks when the normal curve approximates the 
binomial distribution as the sampling distribution of p. 
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6.4 The Normal Curve and Sampling- Risks 

We now know that for simple random samples from an infinite 

population, the normal curve may be used to describe the sampling 

stubution of p when the sample size is sufficiently large In Sddi- 

tion, we know how to determine the values of E(p) and o- the per 

tment characteristics of the sampling distribution of p leeLTl 

determme areas under the normal curve. A table in Appendix B to 

which we have referred before, gives areas under the normal curve 

, ^ US€ thls table often t0 measure probabilities and hence 

should become proficient in its use. ’ 

onJtai/ 1 ofth! ApPen , diX B tel ! s us the area that is contained in 
one tail of the normal curve when we go out a given number of 

standard deviations from the mean. Thus, consider the normal 
urve as an approximation of the sampling distribution of p With 
reference to the diagram below, the table of areas under the^ormal 
curve pees usthe urea in the tail (shaded portion, when w “ 
standard errors of p from E(p), or r: * 



II 1 !’ S ref ° re ’ t0 Specify * in order t0 use the table of 

eas. This quantity, z, is called the normal deviate. It represents the 
distance ton, the mean of the ncmal curve to a p.rtiXfd™'of 

st “ <l " d devi “ 10 "' Sj "” Wi “"r ^ »*y be 


_ V - 7T 


The distance between p and the mean of the distribution is (p - ») 
no “s ft h ° W m “ y “«" d ” d d ™«»» 
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Examvle 6 3 What is the probability that a sample proportion 
wiD exwed 52 if it is computed from a simple -dorr^ sample of^ 
100 drawn from an infinite population for which tt - .50 1 the area m 
nuestion^is represented by the shaded portion in the dragram below. 



To use the table of areas, we must determine z: 

_ P ~~ T . 

O p 


We first compute o p \ 


Op 


4 


7r(l — t) 

n 


.50 X .50 


= .05 


100 


Then we substitute in the expression for z> 


z = 


.52 - .50 

M 


= = .40 . 

.05 

The table of areas states that the area in question is 

the probability of getting a sample proportion equal to .52 or more 

■ 34 lt t!mrwe 5 shan d be interested in the area in the central portion of 
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.50 .52 
VALUE OFp 

We have found the area in the tail to be .3446. But, the area in half 
the normal curve, because of its symmetry, is .5000. Hence, the area 
m the central portion is 

.5000 - .3446 = .1554 , 

and this represents the probability that p would fall between 50 and 
.52 if 7 r = .50. 

You might also verify that the probability that p would be less 
than .52 is 

.5000 + .1554 = .6554, 
or 

1 - .3446 = .6554 . 

Why? 

Our main interest in probability is as a measure of risk in relying 
on samples. Let's look at some illustrations of how the normal curve 
can be of use in this connection. 

Example 6.4 A corporation has been contemplating the installa¬ 
tion of an electronic computer to save time on paper work. The con¬ 
troller opposes the installation of the new equipment on the ground 
that paper work accounts for only 30 per cent of employees’ work 
time. His assertion is challenged, and a study is undertaken to de¬ 
cide whether or not to install the equipment. 

A statistical decision procedure is formulated. A simple random 
sample of 100 different time periods is to be selected. The worker to 
be observed at each time period is chosen at random—the same 
worker may be selected for observation more than once. At each time 
period a worker is reported to be doing or not to be doing paper work. 
The sample proportion p is the proportion of time periods workers are 
engaged in paper work. It is agreed that if p is found to be .40 or 
larger, the corporation will install electronic equipment; if p is less 
than .40, the equipment will not be installed. 
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What is the risk that the company will install the new equipment 
even though the controller’s assertion that tt = .30 is correct? The 
problem stated statistically is to determine the probability of obtain¬ 
ing a value of p equal to or greater than .40 if t r = .30 and n - 100. 

We can use the normal curve as an approximation of the sampling 
distribution of p since a sample size of 100 is sufficiently large for 
tt = .30. The probability is measured by the shaded area in the dia¬ 
gram below: 



we note that 




I 


?r(l ~ *) 

n 


(.30) (.70) 

100 


= .046 


and, hence, 

.40 - .30 _ .10_ 

0 “ .046 .046 

= 2.17 . 

When z = 2 17, the area in the tail of the normal curve is .0150. In 
other words, the probability that p will equal or exceed .40 and, 
hence, that the company will install the equipment is .015 if w - .30. 
Thus, 15 times in a thousand, the corporation would be m error m 
using a decision rule such as the one suggested, if tt was .30. Do you 
think this risk is too great? 


Example 6.5 To illustrate further the use of the normal curve to 
measure risks, we continue with the same problem situation and sam¬ 
ple size given in the preceding example.. Suppose that tt is .45. 
What is the probability that the corporation will not install elec¬ 
tronic equipment under the same decision rule? Remember that the 
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callv^P srT'nt 104 b ® insta ! 1 ® d if P is less than .40. Stated statisti¬ 
cally, we are interested m determining the probability of obtaining 

ample proportions less than .40 from simple random samples of 100 
low - A5 ‘ The arCa m questlon is the shaded area in th e diagram be- 



VALUE OF p 

To find z, we note that 


and that 


Hence, 




(T « 




V 


7r(l ~ t ) 
n 


(•45) (.55) 
100 


.050. 


z — 


~ .05 
.050 

= - 1.00 . 

Don t let the minus sign upset you. It merely indicates that p is 
less than Since the normal curve is symmetrical, we can ignore^the 

bmty“o be nS 1587 ^ 6 ° f areaS fOT 2 = 1 -°°- We find ^ P r °ba- 

t . Tbus ’ “ foUowi "S its decision rule, about 16 times in a hundred 
the company would commit the error of not installing electronic 

work time Whe ” aCtUalIy lts paper work ara °unted to 45 § per cent of 

We have used the normal distribution to measure sampling risks 
in problems where the sample proportion was the criterion for deci¬ 
sion. These risks could have been computed by use of the binomial 
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formula. However, the use of the normal curve greatly simplified the 
calculations necessary for the measurement of these risks. 

6.5 Finite Twofold Populations 

We have seen that the binomial gives us an exact distribution of 
the sampling distribution of p only when the proportions are com¬ 
puted from simple random samples selected from infinite popula¬ 
tions. In addition, we know that as the size of such samples from in¬ 
finite populations increases, the sampling distribution of p is approxi¬ 
mated by the normal curve. 

Although infinite populations of attributes occur frequently in sta¬ 
tistical practice, so do finite populations. The characteristics of a lot 
of manufactured product, if each item’s quality is measured as de- 
fective or nondefective, serves as one example. The population of 
homeowners versus renters in a town involves another. And m fact, 
any time we conduct an enumerative study we have another exam¬ 
ple of a finite population. In such instances the assumptions used to 
develop the binomial do not apply since the probability of one item 
being selected for a sample is not independent of, but conditional 
upon, the items that were selected before it. 

The basic question is, then, how can we determine the sampling 
distribution of p when we select a simple random sample from a 
finite population? Is there a formula which will give us these prob¬ 
abilities just as the binomial serves us for infinite populations? The 
answer is “yes.” A formula can be derived to determine the exact 
sampling distribution of p from a finite population. 1 Such a formula 
could be developed by the application of the conditional probability 
rule in the same way that the binomial formula was derived by the 
application of the rule for repeated independent trials. However, to 
develop this formula here would carry us into the small print of the 
study of statistics, and beyond the range of the present treatment. 

Some characteristics of the distribution of p from a finite popula¬ 
tion, however, are of interest to us. In the first place, when the sam¬ 
ple size is small relative to the size of the population, the sampling 
distribution for p is approximated closely by the binomial distribu¬ 
tion. This is not surprising. If we were to draw a sample of 100 em¬ 
ployees from a factory employing 10,000, the probability of select¬ 
ing the first member of the sample would be tio.ootC the second 
member given the selection of the first, %,999 ; the third given the 

iThe formula is known as the hypergeometric, and its application to all possible 
values of p gives rise to the hypergeometric probability distribution. 
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selection of the first two, 1 / 9 , 9 os! and so on. Thus, although the 
probabilities are not the same, the differences are very small. 
Therefore, the use of the binomial distribution which assumes equal 
probabilities will be only slightly in error. As a general rule, if the 
sample size is 10 per cent or less of the population size, the bino¬ 
mial distribution gives a satisfactory approximation of the sampling 
distribution of p from a finite population. 

Example 6.6 A simple random sample of 4 invoices is taken from 
a file containing 100 invoices. Note that this sample size is only 4 per 
cent of the population size. If 7 r = .20 is the proportion of invoices in 
error in the population of 100 , what is the sampling distribution of 
p{p representing the proportion of incorrect invoices in the sample) ? 
The exact probabilities may be derived by the application of the con¬ 
ditional probability rule. Thus, for example, the probability of ob¬ 
taining p 1 .00 i.e., 4 out of 4 invoices with errors—is 


P{p = 1.00) = ^9. x — x - 1 - x — - # nni 9 
; 100 X 09 X 98 X 97 “ 0012 ‘ 


In contrast, the binomial probabilities may be obtained by repeated 
application of the binomial formula. 


p 

Exact 

Probability 

Binomial 

Probability 

.00. 

.4033 

.4096 

.25. 

.4191 

.4096 

.50. 

.1531 

.1536 

.75. 

.0233 

.0256 

1.00. 

.0012 

.0016 


Thus, the binomial provides a relatively good approximation of the 
exact probabilities in this case. 


A second important characteristic of the sampling distribution of 
p from a finite population is that as the sample size increases, this 
distribution, like the binomial, may be approximated by the normal 
curve As you know, the mean and standard deviation of p are 
needed to approximate probabilities by means of the normal curve. 
The expected value of p from a finite population is ■*, which is the 
same as the expected value of p from an infinite population. How¬ 
ever, the formula for the standard deviation of p from a finite pop¬ 
ulation differs slightly from that for p from an infinite population, 
thus, for a finite population 





N -n 

n - r 


€T, 
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The only difference is in the factor \/N — n/N — 1. Because this 
factor is attached to the formula for <r p when the population is 
finite, it is called the finite population correction, or simply the 
f.p.c. 

This “f.p.c.” must be less than one. This means, when tt and n are 
constant, <r p will be smaller for finite populations than for infinite 
populations. This is reasonable. Certainly if we sample from a 
finite population, the amount of variability in the sample statistic 
should be somewhat less than if we sample from an infinite popula¬ 
tion. 

Furthermore, if the sample size n is relatively small compared 
to the population size N —i.e., n is 10 per cent of N or less the 
f.p.c. is very close to 1, and need not be used to compute <j p . This 
is readily understandable for when n is small relative to N, the 
sampling distribution of p from a finite population is approximated 
by the sampling distribution of p from an infinite population. 

Example 6.7 A life insurance company packs its rejected applica¬ 
tions in batches of 500. Before the applications are stored, a random 
sample of 100 applications is selected to determine whether the ap¬ 
plications should be stored away or whether the remaining applica¬ 
tions should be examined further to see that no requests for insurance 
have been denied in error. The company uses the following rule: If 3 
per cent or fewer of the sample of 100 indicate errors were made, the 
applications are stored immediately; however, if more than 3 per 
cent indicate errors were made, the remaining 400 are checked fur¬ 
ther. . , . 

How much protection would this decision rule give the company 
against neglecting to re-examine a batch of applications that had 5 
per cent of the applications incorrectly rejected? Stated statistically, 
the problem is to determine the probability of finding 3 per cent or 
fewer applications defective in a random sample of 100 when tt = .05. 
The probability desired is measured by the shaded area in the dia¬ 
gram below: 



.03 
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To determine this area under the normal curve, we must find 


Thus, 


z 


P 7T 


\ n 

= frnrn 

\ too 

= (.022) (.895) 

= .020 . 


V 

>/ 


iV - n 
W - 1 


500 - 100 
500 - 1 


Substituting in the equation for z, 


.03 - .05 
25 .020 
= - 1 . 00 . 

When z = 1.00, the area in the tail is .1587. Therefore, the probabil¬ 
ity of neglecting to examine applications when 5 per cent are in error 
is approximately .16. 


Example 6.8 A political party leader, before the party’s nominat¬ 
ing convention, conducts a survey to determine the popularity of his 
favorite candidate for the gubernatorial nomination. He selects a 
simple random sample of 400 from the half-million registered voters 
of his party. What is the probability that the results of the poll will 
be in error by 2 percentage points or more if 60 per cent of the regis¬ 
tered voters actually do favor the candidate? An error of 2 percent¬ 
age points can be made in two ways. The sample p can either be 2 
percentage points less than or 2 percentage points more than This 
accounts for the two shaded areas in the following diagram: 



To compute 
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we first find 



r(l — 7r) 

n 


(•60) (.40) 
400 


- .0245 . 


Notice that in this case we did not use the f.p.c. even though the pop¬ 
ulation is finite. This is because the f.p.c. is close to 1, a fact you may 
verify. 

When p = .62, 


.62 - .60 
z ~ .0245 

= .82. 

When z — .82, the area in the tail is .2061. Therefore, the probabil¬ 
ity of getting sample proportions of .62 or greater is .2061. However, 
the probability of getting sample proportions of .58 or less also is 
.2061, because of the symmetry of the normal curve. Thus, the prob¬ 
ability that an error of 2 percentage points or greater will be made in 
the survey, based on a simple random sample of 400 if x = .60, is 

.2061 + .2061 = .4122 . 

On the basis of this result, do you consider the sampling procedure 
sound? 

The fact that the f.p.c. is almost equal to 1 when n is small rela¬ 
tive to N points up an important thought. Let us investigate the 
relationship of <r p to N, by considering a group of 4 populations 
which vary in size but have the same population proportion—rr = 
.50. The pertinent characteristics, n and N may be summarized as 
follows: 


Population 

7r 

N 

A . 

.50 

200 

B . 

.50 

1,000 

10,000 

C . 

.50 

D . 

.50 

Infinite 




Suppose we were to select a simple random sample of 16 observa¬ 
tions. The values of <r p for each of the 4 populations would be as 
follows: 

A B C H 

0* .1150 .1231 .1248 .1250 
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This is indeed an amazing result. All values of are practically 
equal even though the population sizes vary from 200 to infinity 
Thus, for a given the variability of a sampling procedure and' 
hence, the risk of relying on such samples will be about the same 

even though one population is relatively small and the other rela- 
tively large. 


6.6 The Sampling Distribution of the Mean 

In the previous sections we were concerned with the sampling 
distribution of the sample proportion. The sample proportion is 
important when the population of interest is one of attributes and 
the decision-parameter is We turn now to a consideration of the 
sampling distribution of the sample mean (x). The sample mean is 
an important statistic when the population of interest is one of 
variates and the decision-parameter, /«. is unknown. Thus, the ma¬ 
terial for x we will consider in this chapter is somewhat analogous 
to the material for p we covered earlier. However, at that time we 
were able to derive an exact expression for the sampling distribu¬ 
tion of p with simple random sampling from an infinite popula¬ 
tion. This was called the binomial formula. In connection with x, 
owever, we shall not be able to find anything quite comparable 
to the binomial. However, we shall develop a normal approxima¬ 
tion to the sampling distribution of x. Fortunately, it is a very 
good approximation for almost all problems with which one is 
faced in practice. 

To gam an appreciation of the reasons why this approximation 
is a good one, we shall begin with another simplified example 
Throughout the discussion of this illustration we shall assume sim¬ 
ple random sampling. Furthermore, the immediate example repre¬ 
sents an enumerate study, and it therefore deals with a finite 
population. This means that for the time being we are going to deal 
with the sampling distribution of 5 under the conditions of simple 
random sampling from a finite population. Later we shall extend 
the results to simple random sampling from an infinite population. 
Now, for the basic facts of the illustration. 


Example 6.9 A firm desires to determine the average or mean age 
of its existing accounts receivable on the basis of a sample survey. 

. f.J esu , sam Ph are to enable the firm to decide whether 

additional measures are to be taken in an attempt to speed up pav- 
ment of accounts. The firm decides that if they knew P was greater 
than 45 days, they would take additional measures; otherwise they 
would take no action. J 
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Suppose that the set of ages (in days) of the 10 accounts receiv¬ 
able (the small population for convenience) are, as of December 31, 
as follows: 


Name of Account 

Age (in Days) 

A • • • • 

.42 

B . 

.28 

c . 

.61 

n _ 

.30 

n . 5 

F . 

.15 

a 

.48 

H . 

.25 

I . 

.36 

J. 

.40 


Of course, if the firm knew this population, they would compute 
p = 3 3 o / 10 - 33 , and take no action. However, we are interested m 
what they might do if they didn’t know this population and made 
their decision on the basis of sample information. 

We will begin our analysis of the sampling distribution of x with 
reference to this example. Suppose, to start, we consider all possible 
samples of size 2 that can be drawn from the population of 10 ages. 
There are 45 possibilities. Of course, only one would be observed 
in a practical situation, but to learn about the sampling distribu¬ 
tion of x here we are studying all of them. 

Now, if our selection procedure is simple random sampling,, then 
each one of the 45 possible samples will have equal probability of 
being the one selected. For each of the 45 samples, we can compute 
a sample mean. Each of these 45 sample means likewise has the 
same probability of occurrence. The distribution giving the proba¬ 
bility of occurrence of these sample means will represent the sam¬ 
pling distribution of x for samples of size 2 from the population. 
In Table 6.1 all possible sample combinations are listed together 
with their associated sample means. For example, the sample mean 
associated with the sample based on accounts B and D is 

_ 28 + 30 __ on 

* = — 2 — = 29 . 

The next step is to transform the information in Table 6.1 into 
a sampling distribution of x. This must show all possible values of 
x and their respective probabilities of occurrence. To this end, we 
may develop a table along the lines of Table 6.2. There, the 45 sam¬ 
ple means have been grouped into intervals. The number of sample 
means that fall into each of these intervals is listed, and, more im¬ 
portant, the probabilities of our selecting a mean in these intervals 
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are given. For example, there are 5 possible sample means in the 
interval 38-39.9 days. Since each has probability % B of being se¬ 
lected, the probability associated with that interval is % B) or .111. 
In other words, the probability of selecting a sample with a mean 
from 38 to 39.9 days is .111 (if we were to select a simple random 
sample of 2 from the population under consideration). 

Table 6 .7 

ALL POSSIBLE SAMPLE MEANS FOR SAMPLES OF SIZE 9 FROM 
= POPULATION OF TEN AGES OF ACCOUNTS RECEIVABLE M 


Accounts in 
Sample 


Sample 

Variates 


Sample 

Mean 

(x) 


Accounts in 
Sample 


• • (42, 28) 

35.0 

CJ. 

• • (42, 61) 

51.5 

DE. 

. (42, 30) 

36.0 

DF. 

. (42, 5) 

23.5 

DG. 

■ (42, 15) 

28.5 

DH. 

■ (42, 48) 

45.0 

DI. 

(42, 25) 

33.5 

DJ. 

. (42, 36) 

39.0 

EF... . 

• (42, 40) 

41.0 

EG.... 

. (28, 61) 

44.5 

EH. 

■ (28, 30) 

29.0 

El. 

• (28, 5) 

16.5 

EJ. 

. (28, 15) 

21.5 

FG. 

• (28, 48) 

38.0 

FII. 

• (28, 25) 

26.5 

FI. 

. (28, 36) 

32.0 

FJ. 

(28, 40) 

34.0 

GH. 

. (61,30) 

45.5 

GI. 

• (61, 5) 

33.0 

GJ. 

• (61, 15) 

38.0 

HI. 

• (61, 48) 

54.5 

HJ. 

(61, 25) 

43.0 

IJ. 

• (61, 36) 

48.5 



Sample 

Variates 

(61, 40) 
(30, 5) 
(30, 15) 
(30, 48) 
(30, 25) 
(30, 36) 
(30, 40) 

( 5, 15) 

( 5, 48) 

( 5, 25) 

( 5, 36) 

( 5, 40) 
(15, 48) 
(15, 25) 
(15, 36) 
(15, 40) 
(48, 25) 
(48, 36) 
(48, 40) 
(25, 36) 
(25, 40) 
(36, 40) 


Sample 

Mean 

(x) 


1,485.0 

33.0 


In short, Table 6.2 represents a sampling distribution of x. A 
c ea,rer mental picture of exactly what this sampling distribution 
ot x looks like may be gained from Figure 6.3. 

Remember that any sampling distribution refers to a specific 
statistic—this one refers to x, the sample mean. It also refers to a 
given sample size so far we have considered only n = 2. In addi- 
tion, the sampling distribution in Figure 6.3 portrays the pattern 
o variability of x under simple random sampling from the specific 
population of ages of accounts receivable under study. Vary any 
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one of these circumstances and we would obtain a different proba¬ 
bility distribution. 

Now that we have a pretty fair idea of what the sampling distri- 
bution of x refers to, let’s see what happens if we increase the 
size of the sample. Suppose we compute the sampling distributions 


Table 6.2 

SAMPLING DISTRIBUTION OF THE MEAN FOR 
SAMPLES OF SIZE 2 FROM POPULATION OF TEN AGES 
OF ACCOUNTS RECEIVABLE 


Sample Mean 
(in Days) 

Number of 
Possible 
Samples 

Probability 
of Occurrence 

10 119 . 

1 

.022 

12 13 9 . 

0 

.000 

14 15 9 . 

1 

.022 

1 R 17 Q . 

2 

.044 

Ig 19 9 . 

0 

.000 

90-9,1 Q . 

3 

.067 

99 91Q . 

3 

.067 

94 9*iQ . 

1 

.022 

96 27 9 . 

4 

.089 

98 9QQ . 

2 

.044 

90_31 9 . 

2 

.044 

99 33 Q . 

5 

.111 

34 35 9 . 

3 

.067 

36 37 9 . 

2 

.044 

qc 3Q q . 

5 

.111 

40 41 9 . 

1 

.022 

AO 43 Q . 

2 

.044 


4 

.089 

46 47 9 . 

0 

.000 

48 49 9 . 

1 

.022 

*±0 ♦.* • * 

SO 51 9 . 

2 

.044 

69 63 9 . 

0 

.000 

54-55.9. 

1 

.022 

Total. 

45 

1 .000* 


* Does not add to 1.000 because of rounding. 


of X for samples of size 3, 4, 5, 6, and 7. To do this we would follow 
the procedure used in the last section, but Table 6.3 spares you the 
details. It shows us the sampling distributions for n - 3, 4, 5, 6, and 
7 as well as for » = 2. Figure 6.4 portrays this analysis in terms of 
six simple graphs, one per sampling distribution. 

One can easily notice from this chart that as the sample size 
increases, the sampling distribution of x becomes less variable. In 
this connection, Table 6.3 also shows that as the size of the sample 
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increases, the scatter of sample means about the population mean 
decreases. Thus, for n = 2 the probability that a sample mean 
falls between 30 and 35.9 is approximately .22. For n — 3 this prob¬ 
ability increases to about .28. Finally, for n = 7 this probability 
rises to about .63. It increases progressively. In fact, no matter what 
interval about the population mean we select, the proportion of 
sample means within that interval increases as the sample size is 
increased. 


Figure 6.3 



One also can notice that as the sample size increases, the sam¬ 
pling distribution of the mean becomes more and more symmetri¬ 
cal. These tendencies are true not only for the sampling distribu¬ 
tion of x from the population of 10 ages but for sampling distribu¬ 
tions of x from almost any population. 

Thus, for example, in Figure 6.5 we present a series of 4 distribu¬ 
tions. The first figure represents a statistical population which, 
because it looks like a rectangle, is called a rectangular population. 
What might happen if we selected a simple random sample from 
this population? Well, if the sample size is 2, we can deduce (math¬ 
ematically) that the sampling distribution of x would look like a 
triangle—look at the second sketch in Figure 6.5. This represents 
the possible and probable variation in the sample mean—it is the 
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sampling distribution of x for simple random samples of 2 from a 
population that looks like the first sketch in Figure 6.5. Similarly, 
the two other curves represent the sampling distributions of x as¬ 
sociated with a rectangular population for n — 5 and 30. 

Notice, in particular, that this set of sampling distributions, even 
though they all are derived from a rectangular population, change 


Table 6.3 


SAMPLING DISTRIBUTIONS OF x FOR n = 2, 3, 4, 5, 6, 7 
ALL POSSIBLE RANDOM SAMPLES FROM POPULATION OF 
TEN AGES OF ACCOUNTS RECEIVABLE 


Sample Mean 
(in Days) 

Probability of Occurrence for Sample of Size 

2 

3 

4 

5 

6 

7 

10-11.9. 

.022 






12-13.9. 

.000 






14-15.9. 

.022 

.008 





16-17.9. 

.044 

.017 





18-19.9. 

.000 

.017 

.014 




20-21.9. 

.067 

.033 

.024 

.008 



22-23.9.. 

.067 

.058 

.033 

.028 

.010 


24-25.9.. 

.022 

.058 

.062 

.056 

.038 

.017 

26-27.9. 

.089 

.100 

.100 

.087 

.081 

.050 

28-29.9. 

.044 

.058 

.090 

.111 

.129 

.117 

30-31.9. 

.044 

.108 

.105 

.139 

.162 

.192 

32-33.9. 

.111 

.075 

.129 

.131 

.176 

.258 

34-35.9. 

.067 

.108 

.110 

.139 

.143 

.184 

36-37.9. 

.044 

.075 

.095 

.111 

.133 

.117 

38-39.9. 

.111 

.100 

.095 

.091 

.071 

.042 

40-41.9. 

.022 

.033 

.067 

.060 

.043 

.025 

42-43.9. 

.044 

.067 

.033 

.036 

.014 


44-45.9. 

.089 

.033 

.029 

.004 



46-47.9. 

.000 

.025 

.014 




48-49.9. 

.022 

.017 





50-51.9. 

.044 

.008 





52-53.9. 

.000 






54-55.9. 

.022 






Total*. 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 


* Do not add to 1.000 in some columns because of rounding. 


shape as the sample size increases—they get less variable and more 
and more resemble a normal curve. 

Figure 6.6 presents a U-shaped population, and Figure 6.7 a pop¬ 
ulation skewed to the right together with sampling distributions of 
x for n — 2, 5, and 30. Again the sampling distributions become 
less variable and more normal. 

Finally, Figure 6.8 represents a normal population distribution. 
It is not surprising, therefore, that all the sampling distributions 
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Figure 67 


Figure 6.8 
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from this population also are normal. Notice, however, that as n 
increases the sampling distribution of x becomes less variable. 

If you now compare the 4 sampling distributions for n = 30 in 
Figures 6.5 and 6.8, you will observe that they are remarkably alike 
even though they are derived from four different populations. This 
is indeed an amazing phenomenon. In fact, for most problems one 
meets in practice, no matter what the shape of a population, the 
sampling distribution of the mean is approximated by the normal 
curve, if n is sufficiently large. This means the normal distribution 
can be used to approximate the sampling distribution of x and, 
hence, to measure sampling risks. To use this result, however, we 
first must study the expected value and standard error of x. 


6.7 Characteristics of the Sampling Distribution of * 

As an aid in our discussion of the expected value of x and of the 
standard error (or standard deviation) of x, suppose that we con¬ 
sider all possible sample means that might be drawn from a simple 
population. Thus, suppose we have a population of the ages of 4 
typewriters—1,2,3, and 4 years. If 2 typewriters are to be selected at 
random from this population, there are 6 possible samples, namely: 


Sample 

( 1 , 2 ).. 

(1.3) . 

(1.4) . 

(2.3) . 

(2.4) . 

(3.4) . 


x 

1.5 
2.0 

2.5 

2.5 
3.0 

3.5 


Each of these samples is equally likely, and, hence, so are each of 
the 6 values of x. The sample mean is a random variable. Its ex¬ 
pected value is easily computed as follows: 


E(x) = g X 1.5 + g X 2.0 + g X 2.5 + g X 2.5 + g X 3.0 + g X.3.5 

15 

6 

= 2.5. 

This refers to the average value of x we would expect to obtain if 
we repeated our sampling procedure over and over again. 

Also notice that the mean of the population of 4 ages is 


1 + 2 + 3 + 4 


2.5. 


M = 


4 
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Thus, the expected value of the sample mean is equal to the mean 
of the population. This always will be true with simple random 
sampling (but not necessarily true with other types of sampling) 
Thus, we have 


E(x) — fi 

as a characteristic of simple random sampling. Of course, any one 
value of x may deviate from m— the fact that E(x) =,, only ex- 
presses a property of the sampling procedure. 

The second characteristic of the sampling distribution of x we 
shall consider is a measure of its variability. This, as noted, is called 
the standard error of x, or synonomously, the standard deviation 
of x. This measure is symbolized by o*. Generally speaking, it meas¬ 
ures variabihty in the possible values of the sample mean. Hence 
the smaller ^ the smaller is the variability in sample means and 
the greater the reliability of a sampling procedure. 

We now may note that 


<Jx 


<T 





N - n 

n ~ r 


The formula may be derived mathematically, but here we must be 
satisfied to check it out by a numerical example. 

Reconsider the population, 1, 2, 3, and 4 years. The 6 possible 
values of x, for samples of size 2, are 


1.5 , 2.0 , 2.5 , 2.5 , 3.0 , 3.5 . 

Their mean, E(x), is 2.5. In addition, their standard deviation 

is ' 




(1.5 — 2.5) 2 + (2.0 — 2.5) 2 -f 4- (3.5 — 2.5) 2 


6 


-\&Z 

= V.4167 
= .65. 


25 + 0 + 0 + .25 + 1 


We should obtain the same result for 
sidered earlier, 


^ by the formula we con- 


<Tx 




n 

1 
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To apply this formula we first must compute <r, the standard devia¬ 
tion of the population. For our simple example, 

In - 9 ! 5 V + (2- 2.5F + (3 - 2. 5) 2 + (4 - 2.5)* 

= \ 4 

= VL25 
= 1.12 . 

In addition, N = 4 and n — 2. Thus, we have 

_ 

" ~ V2 \4 - 1 

= .65 . 

Note that the result we obtain by the use of this formula is the 
same as the answer derived by taking the standard deviation of all 
possible sample means. The use of this formula for ^ enables us to 
calculate the value of the standard deviation of x without writing 
down all possible sample means. It is, so to speak, a short-cut 
formula. It also will serve to give us some insight into the meaning 


of o’ 

First let us see how our knowledge of the mean and the standard 
error of the sampling distribution of ^permits us to use the normal 
curve to calculate risks of relying on x. 


Example 6.10 A manufacturer of temperature control equipment 
plans to take an end-of-the-year inventory of his goods in process. 
The goods in process are stored in 500 barrels to facilitate the work. 
The barrels are assigned serial numbers. He plans to select a simple 
random sample of 100 barrels and to determine the average value 
per barrel. He will multiply this average by 500 to determine the 

value of the inventory. „ , . , , ,, 

Suppose that <x, the standard deviation of the inventory for the 
500 barrels, is estimated as $50. What is the probability that the aver¬ 
age value per barrel will be in error by $5.00 or more? _ 

We know that the sampling distribution of the mean is normally 
distributed, with E(x) = n. The probability in question, therefore, is 
represented by the shaded area in the diagram below: 



VALUE OF X 
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Note that the area in both tails of the normal curve must be consid¬ 
ered because the sample mean may understate or overstate the popu¬ 
lation mean. To determine areas under the normal curve we first 
compute the normal deviate 



Observe that in this problem we are not given the values of either x 
or /x. However, to determine the normal deviate we need only know 
the difference between the two. This difference has been specified as 


_ <7 }N — n 

** Vn N/n-1 
= 50 IdOO — 100 
VlOO \ 500 -1 
- 4.48 . 

Hence, 

Z 4.48 

= 1 . 12 . 

When 2 = 1.12, the normal curve indicates that the area in one tail is 
.1314. Hence, the area in both tails is .2628. In other words, the prob¬ 
ability of an error of $5.00 or more in the estimated average is .2628. 
This ordinarily would be a rather large risk to assume. For one thing 
you will recognize that an error of $5.00 in the average value per 
barrel represents an error of $2,500 in the aggregate value of all 500 
barrels. 

The formula that we have considered for <r a refers to the sam¬ 
pling distribution of x from populations which are finite. You will 
recognize the expression VF- n/N - 1 as the finite population 
correction factor (f.p.c.) used in connection with the formula for 
V As N gets larger, the f.p.c. gets closer and closer to 1. Thus, for 
an infinite population the formula for the standard error of x be¬ 
comes 


In fact, from a practical point of view, we can neglect the f.p.c. 
whenever the sample size is 10 per cent of the population size or 
less since in such instances the f.p.c. is sufficiently close to 1 so that 
it can be disregarded. 

Example 611 The Department of Weights and Measures of a 
large city instructs its inspectors to use the following rule in testing 
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scales: Weigh 4 standardized weights on the scale; if the average er¬ 
ror is more than 1 ounce, condemn the scale, otherwise approve it. 

Suppose that the standard deviation of the errors is .4 of an ounce. 
What would be the probability of approving a scale which on the av¬ 
erage is 1.5 ounces overweight? 

The sampling distribution of x is approximated by the normal 
curve with E(x) = 1.5 ounces. The probability in question is repre¬ 
sented by the shaded area in the diagram below: 



To determine this area we must find the normal deviate 


(Tx 

But, 

_ a 

<Tx /— 

v n 
_ _A 
" V4 
== .2 . 

Notice that we do not use the f.p.c. in this instance because the study 
is analytical—we are interested in the infinite population of errors 
associated with the weighing process. 

Thus, 

1.0 - 1.5 
* ~ .2 
= -2.50 . 

When ^ = 2.50, the normal curve indicates that the area in the tail is 
.0062. Thus, the probability of approving a scale, which on the aver¬ 
age is in error by 1.5 ounces above the true weight, is .0062 under 
the given decision rule. 

You will observe that the formula for o* indicates that its value 
depends on n, <r, and N (when the f.p.c. is used). Let us first con¬ 
sider the relationship between <r a and the sample size, n. We noticed 
in the last section that the sampling distribution of x became less 
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variable as we increased the samnle size Tf „ i u 

this variahiW it +„ u ,JT P 11 "* 18 to be a measure of 

is variability, it too should decrease with increasing „ a „ • , 

£E ,ot " - sh “ : s 


Population 


50 

200 

10,000 

Infinite 


Suppose t hat simple random samples of size 4 were selected from 
each of these populations. What would equal for each nf !h ”? 
sampling distributions? We calculate the values of” from the an 
propnate formulas. The results follow: * th P ‘ 


Population 


wires:™ rrs'rits ™ “ w in 
ssi oZ"sr m **“ the » f * -pS™ 

“ to Sa, ”I’ lto * Distributions of Deciabn.p^^, 

What about th, sampling distribution, of decision-parmetons?” 
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The answer is that neither x, ", nor any other decision-param¬ 
eter has a sampling distribution. We repeat, decision-parameters 

d °yo°; like P we are considering a 

random variable-a measure which ordinarily will vary from sam¬ 
ple to sample. The pattern of this variation is, m fact, exac y h t 
a sampling distribution portrays or summarizes. But m the case oi 
l a ™eter, we are dealing with a constant measure which s 
a property ^of the population under .tud,-a — «“■ is 
constant no matter what sample we select. repeat 

TVnm the answer to our question is simply 
it Le more-decision-parameters do not have sampling distribu- 

tions. 

6.9 You Should Now Know That 

The set of all values a statistic can assume together wlth their 
Kahilities of occurrence is called a sampling distribution. 

P ” »;C«S»ns provide a convenient basis io, compute 

^A 8 aiven sampling distribution always must be thought of as re- 

to a method of sampling, to a sample 

S1Z ThTsampling e distribution of the sample proportion with simple 
random sampling from an infinite population can be computed ex- 
SSX th7binLal formula. The sampling distribution of P de- 

Pe T d hemirof n the sampling distribution of p, called the expected 
value of p, and symbolized by E(p), equals x whether a populat 
* £ nr infinite if simple random sampling is used. 

“ iL., *e standard deviation o( this .am- 

pling distribution, called the sta ndard err or of p, is 


T * = yj 1 


(1 - t r) 


n 


if the population is infinite or very large in relation to the sample 
size, and 


[i r0~^) I; 

\ n N 


IN -n 
N - 1 


“ SeTS^Sln »< P becomes more and mom -normal” 

for larger and larger values of ti. 
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The normal curve often can be used to approximate the sam¬ 
pling distribution of p. 

The sampling distribution of x represents all possible values x 
can attain, together with the probability that each of these values 
will occur. It represents possible values of x for samples of a given 
size under a certain type of probability sampling from a specific 
population. 

The mean of the sampling distribution of x, called the expected 
value of x, equals the population haean under simple random sam¬ 
pling. Symbolically, E(x) '= p. 

Under the same conditions the standard deviation, or standard 
error, of x is 


CTx 



or 



depending on whether the population under consideration is infi- 
nite or finite. 

The sampling distribution of x generally becomes more and more 
“normal” as n is increased. 

The normal curve often can be used to approximate the sampling 
distribution of x. 

Standard errors, such as <r P and <r a , measure the variability of a 
statistic for a given sampling procedure. Hence, the smaller the 
standard error of a statistic, the smaller the risk in relying on sam¬ 
ple information. 


6.10 You Should Now Be Able to Solve 

1. A trade association selects a simple random sample of 100 firms 
from a membership of 510 to determine the average hourly wage for 
various job classifications. Will the sampling distribution of the 
average hourly wage be the same for all job classifications? Explain 
your answer. 

2. A simple random sample of 100 housewives is selected in order 
to estimate the proportion who favor a new cake mix. Does the 
binomial distribution describe the sampling distribution of the rele¬ 
vant statistic? Does the normal curve? Explain your answers. 

3. An accountant uses the following decision rule in auditing ac¬ 
counts receivable: Select a simple random sample of 2,000 entries 
from a total of more than 100,000 entries. If 3 per cent or more of 
the 2,000 entries are in error, check all the entries; if less than 3 per 
cent are in error, approve the accuracy of the ledger. 
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What is the probability that the accountant will approve the 
accuracy of the receivables when actually 5 per cent of the entries 
are in error? What is the probability that he will not approve them 
on the basis of the sample if 2 y 2 per cent of the entries are in error? 

4. Explain the following statement: The standard error of a statis¬ 
tic is a measure of the risk in making a decision on the basis of 
sample information. 

5. Until recently it often was a general practice in industry to se¬ 
lect a 10 per cent sample from an incoming lot of material to deter¬ 
mine whether to accept or not to accept the lot. Evaluate the 
soundness of this procedure. 

6. Unemployment, as measured by the proportion of the labor 
force unemployed, is known to be much higher in one state than 
another. The proportion of unemployed is to be estimated by a 
sample survey in each state. It is desired to have the same sampling 
error, as measured by the standard error of p, for both states. Would 
you or would you not suggest the same size sample for both states? 
Explain your answer. 

7. A firm conducts a sample study to determine the average time 
taken to engrave calibrations on control panel dials. Twenty-five 
observations are to be made, and the value of x used as an estimate 
of /*. On the basis of this estimate, the cost of the engraving opera¬ 
tion for an order of 10,000 dials will be estimated. If it is assumed 
that the standard deviation of the times is 10 minutes, what is the 
probability that the value of x overstates n by 5 minutes or more? 
What is the probability that the value of x understates p by 5 min¬ 
utes or more? What is the probability of an error of 5 minutes or 
more in either direction? 

8. A company manufactures cylinders that average 2 inches in di¬ 
ameter. The standard deviation of the diameters of the cylinders is 
.10 inches. The diameters of a sample of 4 cylinders are measured 
every hour. The sample average is used to decide whether or not 
the manufacturing process is operating satisfactorily. The follow¬ 
ing decision rule is applied: If the average diameter for the sample 
of 4 cylinders is equal to 2.15 inches or more, or equal to 1.85 inches 
or less, stop the process. If the average diameter is more than 
1.85 inches and less than 2.15 inches, leave the process alone. 

What is the probability of stopping the process if the process 
average, /*, remains at 2.00 inches? 

What is the probability of stopping the process if the process 
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average were to shift to ^ = 2.10 inches? To y = 1.90 inches? To 
y == 2.05 inches? 

What is the probability of leaving the process alone if the process 
average were to shift to /* = 2.15 inches? To y = 1.85 inches? To 
p = 2.30 inches? To y = 1.90 inches? 

6.11 You Will Also Find That 

For review and/or further information on the sampling distribu¬ 
tion of p and on the sampling distribution of x, see: 

Hirsch, Werner Z. Introduction to Modern Statistics, pp. 111-23. 
140-51. New York: MacMillan Co., 1957. 

Lewis, E. E. Methods of Statistical Analysis in Economics and Business 
pp. 195-201, 256-63, 270-78. Boston: Houghton Mifflin Co., 1953. 
Rosander, A. C. Elementary Principles of Statistics, pp. 189-204, 240-46. 
New York: D. Van Nostrand & Co., Inc,, 1951. 

Wa lli s, W. Allen, and Roberts, Harry V. Statistics: A New Approach 
pp. 345-81. Glencoe, Ill.: The Free Press, 1956. ’ 



CHAPTER ■ 7 


Risk and Uncertainty in 
Estimation Problems 


7.1 Estimation Decision Problems 

In Chapter 1 and at several subsequent points we have distin¬ 
guished between estimation and testing problems on the basis of 
whether a decision problem involves numerical or nonnumerical 
courses of action. One reason that this distinction is important, as 
you will see, rests with the fact that the risks of incorrect decisions 
may be measured differently for the two types of problems. Further¬ 
more, in planning a sample study, some differences in the statement 
of objectives also are involved, depending on whether a study in¬ 
volves estimation or testing. 

Thus, in this chapter, we shall consider some theory, methods, and 
applications dealing with estimation decision problems. The next 
chapter will consider similar aspects of testing decision problems. 

Generally speaking, a statistical estimation problem involves se¬ 
lecting an estimate which approximates the value of a decision-pa¬ 
rameter. Suppose that we refresh and extend our recollection of esti¬ 
mation decisions by way of some illustrations. 

Example 7.1 A not uncommon estimation decision problem arises 
whenever an individual must purchase a pane of glass to replace a 
broken window. Thus, one must decide on the length and on the 
width of the pane to be purchased. Both of these values may be esti¬ 
mated on the basis of observations—measurements of the window 
frame. Hence, you can see the problem is statistical—particularly if 
you recognize that repeated measurements of the length (as well as 
the width) would, to some extent, vary and that two or three or even 
one dozen measurements on the same window would represent only a 
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sample of measurements. Thus, the choice of a course of action must 
depend on a sample of observations. 

Example 7.2 An interesting estimation decision problem in the 
field of accounting arises in the settlement of inter-airline accounts. 1 

The problem arises because of situations such as the following. Sup¬ 
pose a passenger wishes to travel from Portland, Oregon, to Seattle 
Chicago, and. New York City. If he purchases a complete ticket in 
Portland, United Air Lines will collect the entire fare ($158.85) for 
the travel indicated. However, UAL will carry the passenger only 
from Portland to Seattle. Northwest Air Lines would provide trans¬ 
portation from Seattle to Chicago, and Trans-World Air Lines from 
Chicago to New York. 

Thus, the problem centers on estimating the amount of the total 
fares UAL owes NWA and TWA—not for one ticket but perhaps for 
20,000 such tickets sold during a given month. Alternatives are repre¬ 
sented by the various estimates of revenue to be paid to NWA and 
TWA. United Air Lines, and other companies as well, have devel¬ 
oped statistical decision rules based on sample information to decide 
upon .the choice of alternatives and, hence, upon the settlement of 
interline accounts. 

In the two preceding illustrations, a course of action would fol¬ 
low directly from an estimate of the decision-parameter. In many 
other business decisions, however, it is practically impossible to 
formulate one's problem so that action will flow from such estimates 
alone. Other factors, some qualitative, must be taken into account 
together with estimates of several decision-parameters in order to 
decide on a course of action. 

Example 7.3 One management textbook 2 lists the following as cri¬ 
teria which should.be considered in the location of a business: 

(a), proximity to desired markets, (6) proximity to supply of raw ma¬ 
terials, (c) availability of labor, ( d ) adequacy and cost of transporta¬ 
tion, (e) adequacy and cost of power and fuel, (/) proximity to firms 
in similar fields, (g) suitability of climate, ( h ) character of the com¬ 
munity and special inducements, (i) reasonableness of state and 
local taxes, and (;) suitability to decentralization. 

Some of these criteria, such as the character of the community, 
must be evaluated qualitatively in making a decision on where to lo¬ 
cate. However, others, such as the average temperature, rainfall, and 
other weather factors bearing on the suitability of climate, may be 
evaluated quantitatively and estimates of these quantities derived 

1 Wmston C. Dalleck, “Inductive Accounting: Settling Interline Accounts by 
Sampling Methods/’ Industrial Quality Control, Vol. XIII/No. 6 (December, 1956), 

pp. 12—16. ’ 

2 John A. Shubin, Business Management: Efficiency Surveys and Systems (New 
York: Barnes & Noble, Inc., 1954). 
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on the basis of statistical studies. Thus, some of the bases for deci¬ 
sion are provided by estimates of decision-parameters. 

Problems of this type are not wholly statistical decision prob¬ 
lems. They merely contain some statistical aspects. The statistical 
part of such a problem is to estimate pertinent decision-parameters, 
knowledge of which would contribute to the basis of a managerial 
decision. Of course, reliable estimates are important even though 
the decision problem is not wholly statistical. 

Thus, we see that statistical estimation is important in some 
problems as a direct basis for decision; and in others as a means of 
contributing to a basis for decision. Often, of course, the way in 
which a problem is formulated governs the way in which such esti¬ 
mates will be used—directly or indirectly. In any event, estimates 
must be made to serve a useful function if they are to be worth their 
cost. Hence, one always must plan, in a general way, how he will use 
them. In addition, to further motivate our study of estimation, we 
may note that unless estimates are computed by reliable methods, 
they will be unable to serve a useful function. 

The above illustrations imply the necessity of primary studies— 
studies geared to the particular problem at hand. It also may be 
noted that many secondary studies involve estimation. That is, 
from the point of view of the producer of such data, the purpose of 
his studies is to provide information for the use of consumers of the 
data. This information ordinarily may be provided most conven¬ 
iently in the form of estimates of proportions or averages or aggre¬ 
gates or whatever else is of interest. 

Example 7 A A few years ago Life magazine sponsored an exten¬ 
sive and comprehensive statistical study of consumers’ buying habits 
in the American market. The study, based on a probability sample, 
was conducted as a public service to provide quantitative informa¬ 
tion as a background for the marketing decisions of individual busi¬ 
nesses. The survey was conducted for Life under the auspices of the 
Alfred Politz Research, Inc. 

Among the hundreds of estimates prepared as a result of the study 
were estimates of average annual household expenditures for various 
goods and services by families with different annual incomes, esti¬ 
mates of the proportions of income spent by families on different 
goods and services, estimates of the frequency distribution of house¬ 
holds by annual income, etc. Thus, information of interest to those 
who make marketing decisions was presented clearly and concisely in 
terms of estimates of decision-parameters. 

As has been already noted, estimation problems involve select¬ 
ing, on the basis of sample information, estimates which approxi- 
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mate the values of decision-parameters. Of course, we want, in 
these problems, to learn how to derive good approximations—that 
is, to use sample information most efficiently. Thus, our main task 
in this chapter is to learn what rules one should follow in this en¬ 
deavor. For example, it may seem reasonable to use the value of a 
sample statistic (such as x) as an approximation to a population 
parameter ( such as /x)—but this may be a risky business. The ques¬ 
tion that arises, therefore, is: “How can we appraise this decision 
procedure?” 

In discussing estimation it is customary to distinguish two types 
of estimates—generally called (a) point estimates, and (b) interval 
estimates. The difference between (a) and (b) is quite simple and 
may be pointed up by considering an example. 

Example 7.5 A recent sample survey of assessed valuations in the 
United States reports estimates of the per cent of total assessed 
value that consists of “residential one-family houses.” The estimates 
are reported both as point estimates and interval estimates. Thus, 
for the state of Virginia estimates are as follows: 

Per Cent of 

Type of Estimate Total Assessed Value 

Point. 470 

Interval .46^ to 48.9 

As you can see, the point estimate is a single value which attempts 
to “hit” the true but unknown population ratio. On the other hand, 
the interval estimate is a range of values which it is hoped includes 
the true but unknown population ratio. 

As this illustration implies, a point estimate is a single estimate 
of the value of a decision-parameter, while an interval estimate 
takes the form of a range of estimates. Making a point estimate is 
like throwing a dart at some point on a game board, while making 
an interval estimate is analogous to tossing a hoop at the point and 
trying to encircle it. Of course, whether one throws a dart or a hoop 
depends on the game he is playing. Similarly, whether one makes a 
point estimate or an interval estimate depends on his decision 
problem. For the present we shall center attention on point estima¬ 
tion. In Section 7.6 we shall consider procedures for constructing 
interval estimates. 

Let's consider one other introductory thought about estimation 
decision problems. This involves an answer to the question: “What 
form does a statistical decision rule assume in such a problem?” As 
we know, a statistical decision rule relates actions to possible obser¬ 
vations before a study is carried out. It is a complete plan on how to 
sample and on how to use the sample obtained. Furthermore, it is 
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important to remember it is formulated a priori. These are genera 
ideas. To begin being more specific about the form of such a rule m 
an estimation problem, suppose we consider another illustration. 


Example 7.6 A plastics firm must decide on a piecework wage rate 
for a buffing and polishing operation. The average time for this oper¬ 
ation is an important factor in determining the rate. Since the popu¬ 
lation of observations is infinite because the study is analytical, a 
sample study is planned to estimate the mean time for the operation. 
The choice of a wage rate will be based on this estimate. Hence the 
time-and-motion-study man suggests the following statistical deci¬ 
sion rule- Select a simple random sample of 20 and use the value of 
the sample mean, 5, to estimate the population mean, The proce¬ 
dure indicated obviously involves statistical estimation. 


As this example implies, there are at least three integral parts to 
a statistical decision rule in an estimation problem. 

One is the instruction on what method of sample selection to 

use; e.g., simple random sampling. 

The second is the instruction on how large a sample to select; 
e.g., 20 observations on the operation. 

The third is a rule or formula for translating the sample obser¬ 
vations into an estimate of the decision-parameter of interest; e.g., 
use the value of x as an estimate of i*. This rule or formula is called 

an estimator. _ 

All three of these integral parts of a decision rule should be for¬ 
mulated in the planning stage of a study. However, they should not 
be formulated haphazardly. There are certain principles that must 
be followed. In this chapter, we shall consider the problem of formu¬ 
lating decision rules for estimation problems and how a decision 
maker can rationally answer such questions as: “What formulas 
(estimators) should we use to construct estimates?” “When should 
simple random sampling be preferred to other types of probability 
sampling?” “How do we determine sample size?” 


7.2 Estimators 

As was indicated in the last section, one integral part of a statis¬ 
tical decision rule in an estimation problem is the estimator—the 
formula which translates sample observations into an estimate of a 
decision-parameter. 

Among the simplest and most reasonable estimators are: (l) x, 
the sample mean, as an estimator for p, the population mean, and 
(2) p the sample proportion as an estimator for w, the population 
proportion. Although these estimators undoubtedly seem reason- 
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able to you, it probably is difficult for you to explain why they do. 
What does “reasonable” mean in this connection? Think it over. 

In any event, our present task is to give “reasonableness” a more 
precise statistical meaning. We must consider certain criteria for 
evaluating estimators for sorting out reasonable estimators from 
unreasonable estimators. 

Note, first of all, that we cannot always select the corresponding 
statistic as an estimator for a decision-parameter. For example, the 
sample aggregate would be a poor estimator for a population aggre¬ 
gate. If you don t think so, consider the following problem: Sup¬ 
pose you wanted to estimate the total amount of change in the 
pockets of all your fellow students (e.g., at next Monday’s class). 
Would the aggregate in a sample of two pockets be a reasonable 
estimate of the aggregate amount of change in everyone’s pockets? 
You may reason out that a better estimator for a population aggre¬ 
gate is Nx. Why? 

In our discussion of criteria for evaluating estimators the follow¬ 
ing notation should be kept in mind. First of all, w, ^ A, etc., as 
always, will represent decision-parameters—population properties, 
the values of which we desire to estimate. Secondly, an estimate of 
any decision-parameter will be symbolized by a caret over or /* or 
A, etc. That is, estimates will be symbolized by £, /t, A, etc. 

Thus, when we express any estimator formula-wise, we should 
write, for example, 

* = V, 

which says “estimate the value of * by using the value of the sam¬ 
ple proportion.” Similarly, other estimators we already have dis- 
cussed are, in this notation, 

JU = x 

and 


A = Nx. 

Now, with this notation in mind, let’s consider three criteria 
which statisticians often use to evaluate the reasonableness of esti¬ 
mators. These criteria or properties are called consistency, unbiased¬ 
ness, and efficiency. 

Roughly speaking, an estimator is consistent if, as the sample 
size is increased, the probability that an estimate will be approxi¬ 
mately equal to the decision-parameter tends closer and closer to 
one. To be more precise—in the case of a finite population—we may 
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say an estimator is consistent if an estimate becomes exactly equal 
to the decision-parameter when n — N. 

Example 7.7 A mailing list service sells lists of names for use in 
advertising and promotional campaigns. Some lists include persons 
in specified income brackets, others include only doctors or only law¬ 
yers, and still others include persons in given geographical areas. 
Suppose that the service must decide on a price to charge for two such 
lists. To do so they must know the number of persons common to 
both lists. 

Assume that one list contains N names, the other M names, and 
simple random samples of n and m names, respectively, are selected 
from these lists. If there are d duplicates between the samples, we 
may estimate the number of duplicates, D, in both complete lists by 
the estimator 


This estimator has the property of being consistent. Note that if 
n — N and m = M, d must equal D because a complete count is being 
taken. In such a case our estimator becomes D = D. Any estimate is 
exactly equal to the decision-parameter, D. 

For an illustration of the use of this estimator, suppose that we 
have two lists with 10,000 and 5,000 names, respectively. If a sample 
of 400 names is selected from each list and 7 duplicates observed, we 
would estimate D as 


n _ (10>0QQ) (5,000) m 
U ~ (400) (400) v } 

= 2,188 . 

Consistency, or the lack of it, may be interpreted in terms of a 
sampling distribution. Thus, we know that the sampling distribu¬ 
tion of x and the sampling distribution of p tend to center more 
and more closely about ^ and it, respectively, as u is increased. 
Hence, the estimators ^ — p and £ = x are consistent. So is the 
estimator A = Nx a consistent estimator. 

Consistency is a desirable property. If an estimator is consistent, 
this means that the larger the sample, the greater the reliability of 
one’s estimate. However, you will note that the criterion of consist¬ 
ency refers to a property of the sampling procedure. It refers to 
what results an estimator would yield if the sample size were in¬ 
creased. Of course, the consistency of an estimator does not guar¬ 
antee a good estimate in a single sample. Consistency is not a prop¬ 
erty of any one estimate. It simply is a property of an estimating 
procedure. This also is true of all other criteria for estimators. Re- 
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member no one can determine whether or not a single sample or a 
single estimate will be good or not. 

. Arl estimator is unbiased if its expected value equals the deci¬ 
sion-parameter being estimated. For example, * = p is unbiased if 
P is the result of simple random sampling. Why? With simple ran¬ 
dom sampling, E(p) = 7t , Similarly, with simple random sampling 
p — x and A = Nx are unbiased because 

E{x) = ft, and E(Nx) = Np ~ A . 

Example 7.8 One simple example of an estimator which is biased is 

a 2 = s 2 . 

That is, the sample variance is not an unbiased estimator of the pop¬ 
ulation variance. The expected value of s 2 does not equal CT 2 Instead 
it can be demonstrated mathematically that for a finite population 

* (s2) = A^7-~r^- 

For an infinite population this becomes 

E(s 2 ) = 0 - 2 1 

n 

(These formulas indicate that on the average the value of a sample 
variance is smaller than that of its population’s variance. Why?) 
Statisticians often prefer y ; 


V = 


N - 1 
N 


n 


n 


as an estimator for the variance of a finite population and 


n — 1 


as an estimator for the variance of an infinite population The ex- 
pected values of these estimators equal ff 2 , and, hence, they are un- 

As a numerical example, suppose that we were to select a simple 
random sample of 4 ball bearings from a lot of 10,000 and found the 
variance of their diameters, s 2 , to be equal to .0012. If we ignore the 

variance's " ^ “ d ° Se t0 h an estimate of the population 

v 2 = | (.0012) 

= .0016 . 

nrndunZ! e 7 ' 9 pr ^ Iems w !> ich deal with the statistical control of 
production processes the sample range often is used to estimate the 

population (process) standard deviation. The great advantage of 
using the range is the ease and simplicity of its calculation S 
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For samples of a given size which are drawn from a normal popu¬ 
lation, statisticians have determined the relationship between the ex¬ 
pected value of the sample range and the standard deviation of the 
population. This relationship which is designated by the symbo , 2, 

may be written as: 

, E(R) 
d 2 = —- 

or, alternately, 

(I2 

E(R) is the expected value of the sample range, and d 2 is a constant 
which depends on the sample size. Some values of d 2 follow. 

Sample Size 

9 .1-128 

* . .2.059 

g..2.326 

In order to estimate «r, we, therefore, may use the range of one 
sample. However, rather than use the range of a single sample, the 
average range for a group of samples of the same size usually is use 
to estimate <r. That is, the unbiased estimator 

* _ R 

a ~ di 

is used. Here, R is the average of several sample ranges. 

Consider this numerical example of the use of the estimator. S p- 
pose that we have 5 samples, each a sample of 4 diameters of steel 
rods. The ranges for the 5 samples are (m inches) .011, -003, ,UU«, 
.023, .014. 

.011 + .003 + .009 + .0 23 + .014 
R = ' 5 

= .012 inches. 

For samples of size 4, <f 2 = 2.059. Therefore, an estimate of the popu¬ 
lation standard deviation is 

■012 
ff 2.059 
= .0058 inches. 

Thus, unbiasedness is a long-run property of an estimator. It 
means that if the estimating procedure were repeated over and oyer 
again many times, on the average the estimator would yield the 
right answer. Thus, with simple random sampling, x is an unbiased 
estimator of v because if we used x over and over again to estimate 
a on the average our estimate would exactly equal 
’ Unbiased estimators seem reasonable, if for no other reason 
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than that they appeal to one’s sense of fairness. However, it must 
be remembered that unbiasedness simply means that the estimat¬ 
ing process is headed in the right direction. Any one estimate of 
course, may be in error. Therefore, we also must consider the mag¬ 
nitude and probabilities of possible errors in evaluating estimators. 
We must consider the potential variability of an estimating proce¬ 
dure. This leads to the criterion of efficiency . 

As you know we may measure the variability in a sampling 
procedure and, hence, in an estimating procedure by the standard 
error or by the variance of a statistic. For example, the variance of 
x for simple random sampling from a finite population is 


This measures the variability in the procedure of using x as an 
estimator of 

Efficiency is a relative criterion for evaluating estimators. That 
is, it involves the comparison of two estimators by comparing their 
variances. Thus, one estimator is more efficient than another if its 
variance is smaller. Note, then, that more efficient means more pre¬ 
cise and less variable. 

Example 7.10 Suppose that we wish to estimate the arithmetic 
mea,n of a population that may be assumed to be normally distrib- 
utecl. One estimator is, of course, 

£ = x , 

which we know to be unbiased (for any type of population). In addi¬ 
tion, m a normal population the mean and the median are equal, 
thus we might use the sample median as an estimator for ^ Fortu¬ 
nately, this estimator also is unbiased and at times quite useful. 

However, the sample mean is a more efficient estimator of a than 
the sample median; that is, it has a smaller standard error. As we 
know, the standard error of x is 

(for an infinite population). In contrast, the standard error of the 
sample median, for large samples from a normal population is 
equal to } 

1.2533a-* 

and, hence, is obviously larger than 

Example 7.11 Unbiased estimators are not necessarily preferable 
to biased estimators. At times the latter are more efficient. For exam¬ 
ple, a commonly used, though biased procedure, is called ratio esti- 
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mation. To illustrate this method in a simple situation assume that 
we wish to estimate the average retail sales in 1959 for 3 stores. 
Suppose that this population includes the following observations: 

Store: ABC 

1959 Sales: $2,000 $4,000 $12,000 


Hence, p = $6,000. , , 0 

If a simple random sample of 2 stores is to be selected, there are 6 

possible samples and 3 possible estimates of p, 


Sample 
A,B. . 

A, C. . 

B, C.. 


Estimate of fi 
$3,000 
7,000 
8,000 


The average (expected value) of these estimates is $6,000, which 
once again demonstrates that the mean of a simple random sample 
is an unbiased estimator of p. However, you will note that the possi¬ 
ble estimates are rather variable—the standard error, in fact, is $2,- 


In contrast to simple random sampling, we might estimate retail 
sales for 1959 by a ratio estimate. Thus, suppose that retail sales for 
1958 also were available as follows: 


Store: A jj ^ 

1958 Sales: $1,000 $4,000 $10,000 

This information can be used in estimating sales for 1959. Thus, sup¬ 
pose that a sample of 2 stores, A and C for example, were selected. 
The average sales for these stores in 1958 were $5,500; those in 1959 
were $7,000. The ratio of average sales in 1959 to those in 1958 for 
the sample is 


7,000 

5,500 


1.2727 . 


If we apply this ratio to the average sales in all stores in 1958, we 
would obtain our ratio estimate 


(1.2727) ($5,000) = $6,364 . 


Again considering all possible samples of size 2, we observe that 
there are 3 possible ratio estimates, namely, 


Sample Ratio Estimate of yx 

A,B. $6,000 

. * A OdA 


Hence, the average (expected value) of these estimates is $6,026 
which is not equal to the population mean. The ratio estimating pro¬ 
cedure is biased—though the bias is slight. However, note that the 
possible estimates are relatively stable—more so than with the un¬ 
biased estimator, x. 

Ratio estimates often are used in marketing research and economic 
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surveys,_ and they are often preferable to simple estimators despite 
the possibility of some bias because they often are far more efficient. 
Of course, auxiliary information (e.g., 1958 sales) must be available 
to use them. 

Several other criteria besides those of consistency, unbiasedness, 
and efficiency have been suggested by statisticians for deciding upon 
estimators. One, which reflects rather recent and important develop¬ 
ments in modern statistical theory, is called the wiiTviinax principle. 

Example 7.12 A good part of the modern developments in the 
field of statistics centers on the matter of how to consider conse¬ 
quences more explicitly in a decision problem. For example, we might 
assume that the loss resulting from a poor estimate of 7 r is propor¬ 
tional to the square of the difference between tt and our estimate; 
that is, proportional to 

(ir — 7r ) 2 . 

Thus, if we were to estimate tt as .5 when it is .4, our loss would be 
proportional to . 01 ; while if we were to estimate 7 r as .9 when it was 
. 6 , our loss would be proportional to .09. 

For different estimates (our alternatives) and for different values 
of tt (states of the world) there are different losses. Suppose, for fur¬ 
ther numerical examples, that we were sampling from a population 
of 3 dwelling units which may be occupied or not occupied. The 
population proportion of occupied dwelling units, tt, might equal 0, 

34, or 1 . Obviously, among our possible alternatives, estimates of 
tt, are 0 , %, %, and 1 . However, we need not limit ourselves to these 
estimates—we may also consider %,%,%, and any other values as 
alternatives. The following table presents the consequences as meas¬ 
ured by the loss function 

(tt _ 

for various states of the world and different possible estimates: 


Alternative 

Estimate 

State of the World 

II 

0 


* “ Vs 

7T — 1 

7 r — 0. 

.000 

.111 

.444 

1.000 

# = K- .. 

.028 

.028 

.250 

.694 

* = M. 

.111 

.000 

.111 

.444 

TT ~ f/2 ... 

.250 

.028 

.028 

.250 

^ = % . 

.444 

.111 

.000 

.111 

* = H . 

.694 

.250 

.028 

.028 

7T —■ 1 . 

1.000 

.444 

.111 

.000 


Now note that if we are to choose an alternative (estimate) on the 
basis of a random sample, the consequences also will depend on 
chance. We then may compute the expected loss of our estimating 
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procedure. Thus, suppose that we were to select a sample of 2 from 
the population of 3 dwelling units and use the estimator ft = p as a 
basis for the choice of an estimate. There would be 3 possible sam¬ 
ples of size 2 that we may draw from our population of 3 dwelling 
units. If x = 0, the proportion of occupied dwelling units, p, in each 
of the 3 possible samples of 2 is obviously 0. Hence, the probability 
that £ = 0 is 1, and our expected loss is 0. However, if tt = %, one 
of the possible samples of 2 would have no occupied dwelling units 
or p = 0; 2 of the samples would have 1 occupied dwelling unit, or 
p = y 2 . Hence, the probability that 

it — p — .00 is , 

while the probability that 

ft — V = M is % * 

The expected loss of this estimator is, thus, 

.111 X H + -028 X%= .056 . 

Similarly calculations show that for the estimator ft = p, the expected 
losses are as follows: 


State of the World 


7T — 0 

II 

TT = % 

IT — 1 

.000 

.056 ! 

.056 

.000 


By what criterion should we choose an estimator? One answer in¬ 
volves considering the maximum expected loss over all states of the 
world (e.g., the maximum for ft = p is .056), and selecting that esti¬ 
mator which minimizes this maximum loss. Such an estimator may 
be derived mathematically and is called a minimax estimator of tt 
because it minimizes one’s maximum loss. 

With simple random sampling from a finite population it may be 
shown mathematically that this estimator is 

. _ \4p + Vzv'jN - n)/(N - f) t 
’’ Vn+ V(N-n)/(N~ 1) 

while for an infinite population it is 

A Vnp + Y<i 

Vi + 1 


Applying the estimator for a finite population to a situation for 
which N — 3 and n = 2, we would note that if p — 0, 


A 

TT 


V2(0) Hr i/ 


while if p = y 2 our estimate would be ft = %, and if p — 1 our esti¬ 
mate would be %. You might verify that for each state of the world 
the expected loss of this estimator is only .028. Hence, the maximum 
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loss for the minimax estimator is .028, while the maximum loss for 
the unbiased estimator is .056. In fact, if we were to“pute tee 
maximum loss for any other estimator of we still would find that 
the maximum loss for the minimax estimator is smaller—hence its 
name, rmmmax estimator. ence lts 

Unfortunately, the use of a minimax estimator or any other similar 

becLfeonSifficuVv f PreS6nt State ° f ° Ur knowle ^ e > * limited 

decision proWemf 7 meaSUmg ° 0nse( l uenoes explicitly in most 

°ur brief discussion of estimators in this section has indicated 
some of the criteria statisticians use to decide between reasonable 
and unreasonable estimators. We considered such criteria as con- 
s ency unbiasedness, efficiency, and the minimax principle Al¬ 
though there is far more that might be said about these matters we 
have covered enough ground to realize that the use of such estima¬ 
tors as ir - p, /* = X, and A = Nx is far from arbitrary. 

7.3 Choice of a Method of Sampling* 

In . ^°™ ulatln g a decision rule for an estimation problem one 
n deClde ° n a “ ethod of Probability sampling at the same’time 
the choice of an estimator is being made. In fact, an estimator such 
as /i a: has no meaning unless the method of selecting the sample 

upon which x is to be based, is specified. In the preceding section 
e considered only simple random sampling as a basis for the esti¬ 
mators discussed. Now let us consider other methods of sampling 
as well, and an answer to the question, “For a given study, how 
does one decide on the method of probability sampling to be used?” 
For example, how should one choose from among simple random 
sampling, cluster sampling, stratified sampling, and sequential 
sampling m a marketing research study? 

Generally speaking, our aim in any estimation decision problem 
is to choose the method of sample selection which will yield the 

smallest risks for a given cost or which will cost the least to achieve 
a given set of risks. 

We know that any method of sampling is characterized by some 
possible variation m a sample result—and that a useful measure of 
this variability (when the sampling distribution is normal) is the 
standard error. Hence, in a given situation to compare methods of 
sample selection, we may compare the standard errors that are as¬ 
sociated with those methods and select the one with the smallest 
standard error. 

In order to discuss this topic more completely we would need to 
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use formulas for standard errors of x and of p, etc., for several meth¬ 
ods of sample selection. However, we have considered such standard 
™ formulas only for simple random sampling. Thus, m th.s pmt 
of our study we must rely on specific numerical examples to indicate 
general points of interest. 

Example 7.IS During World War II, the administration of tire 
rationing*by the government required current and reliable informa¬ 
tion on inventories held by dealers, distributors, and manufacturers 
For reasons of economy and expeditiousness this of 

most always determined by a sample study. That if),■ a 
dealers, distributors, and manufacturers was selected, and the coun 

trv’s inventory estimated on that basis. ,. 

‘ The particular method of sampling often used m these irtuidles was 
stratified sampling. For example, let us consider the March, 1945, 
survey Then dealers were classified into classes or strata according 
to the number of tires they held in September, 1944, a date for which 
completelnformation was available. Thus, one group included deal¬ 
ers who held from 1 to 9 passenger tires on September, 1944b £mot e 
included dealers who held 10-19 passenger tires, etc. A simple ran 
dom sample was drawn from each stratum, and an estimate of the ag 
gregate inventory prepared (by summing the estimates of the aggre- 
gates for each of the strata). 

Stratification increased the efficiency of the sampling plan to a 
great extent. It was estimated that if a simple random sample had 
been selected, 2% times as many dealers would have been necessary 
in order to achieve the same standard error as was achieved with t 

St i“:?bec.u, e the strata inf. which M« were grouped 
were fairly homogeneous with regard to the characteristic under 
consideration. In addition, the number of tires per dealer m one 
stratum differed greatly from the number of tires per dealer m the 
other strata. In general, the aim m stratification is to select strata 
which have little variability within each stratum, but for w bich the 
variability among the strata is large. When these aims are fulfi e , 
stratified sampling involves less risk of error than simple random 

sampling for the same size sample. 

Stratified sampling, however, will as a rule be more costly i : an 
simple random sampling. In the case of stratified sampling there 
must be information which will enable us to divide the over-all 
frame into lists for each stratum. In the tire inventory study, data 

' Fdwarch Deming and Willard A. Simmons, “On the Design of a Sample 

for D^le^s Inventories, “ of the American Statistical Association, Vol. XL 

(1946), pp. 16-33. 
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on tire inventories for September, 1944, were available and, hence, 
the cost of constructing lists for each stratum could be kept rela¬ 
tively small. Cost factors must be considered, for what we desire to 
attain is increased precision per dollar spent. We must be sure, 
therefore, that added gains in precision are not whittled away by 
very large increases in cost. 

Example 7.14 An automobile parts manufacturer, in order to 
make certain pricing decisions, needs information about the number 
of competing brands carried in inventory by gasoline stations who 
also sell the manufacturer’s products. A marketing research firm 
hired to conduct the study considers two possible methods of sample 
selection. One is the selection of a simple random sample: the other is 
cluster sampling. 

Thus, on one hand, a sample of gasoline stations might be drawn 
one at a time and at random from a list of the firm’s customers. On 
the other hand, clusters of 5 neighboring stations might be formed 
and a sample of these clusters drawn at random. 

The main advantage of cluster sampling is a reduction in the costs 
of travel which is necessary to visit stations in order to obtain in¬ 
formation. Thus, for this study it is determined that 60 clusters of 
5 stations can be selected for the same cost as a simple random sam¬ 
ple of 100 stations. 

However, it is recognized that this larger sample size may not 
mean greater precision in the estimates of the decision-parameters of 
interest. Neighboring stations undoubtedly will be similar with re¬ 
spect to the characteristic of interest. Hence, a cluster of 5 stations 
cannot be expected to provide as much different information about a 
population as 5 separately selected elementary units. Although we 
shall not compare their formulas, it may be noted that this phenom¬ 
enon will reflect itself in the standard errors of the estimates. Thus 
suppose that the marketing research firm estimates that for a simple 
random sample of 100 gas stations the variance of the proportion of 
firms which do not carry competing products would be about .0016. 
However, they also estimate that for a cluster sample of 300 gas sta- 
tions the variance of p would be approximately .0012. Note that 
while the cluster sample does not cut the variance of p to one third 
of its value for simple random sampling, it does result in a somewhat 
smaller variance, and, hence, a gain in precision for the same cost is 
to be expected. The survey should be carried out by that method. 

You will notice that to compare simple random sampling and 
cluster sampling in the above illustration, we borrowed the notion 
of efficiency from the last section; that is, we compared the vari¬ 
ances associated with both methods of sampling. Thus, efficiency 
is a criterion for deciding (a) between estimators given a method of 
sampling (as in Examples 7.10 and 7.11 of the last section), and 
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(6) between methods of sampling (as in Examples 7.13 and 7.14 

above). > , , . - 

Our short survey in this section indicates that the choice oi a 

method of sampling is important in the design of a statistical study 
because some methods may yield more reliable and, hence, less risky 
results than other methods. Our discussion of the choice of a method 
of sampling, although quite brief, has indicated the basic principles 
(cost versus precision) by which such choices are made in practice. 
Subsequently, however, we again shall center our studies on simple 
random sampling. 

7.4 Determining Sample Size 

Another integral part of a statistical decision rule in an estima¬ 
tion problem is the sample size. How does one decide how large a 
sample to select in such a problem? The answer to this question is: 
“On the basis of how much sampling risk it is economical to toler¬ 
ate.” Let’s discuss this statement further 

We already are well aware that the possibility of error in a sam¬ 
ple result always exists. Thus, there always is the possibility of in¬ 
correct decisions in estimation problems. What are these incorrect 
decisions? Generally speaking there are two types: overestimating 
and underestimating. Of course, some estimates are poorer than 
others. Still generally speaking, the larger the error in any estimate 
of a decision-parameter, the poorer is our choice—the more serious 
our incorrect decision. Thus, suppose that we estimate the inven¬ 
tory of a product at 50,000 units and estimate its value accordingly. 
This is an incorrect decision if 55,000 units is the actual inventory. 
Yet it is not as incorrect as if the estimate were 40,000. 

However, you also should recognize that in many problems some 
error or risk can be or should be tolerated. In such cases one need not 
aim at perfect precision—it is unnecessary. One should consider in¬ 
stead (1) how large an error in the estimate can be tolerated, and 
(2) how great a risk of exceeding that error should be taken. These 
two considerations are necessary before proceeding to solve a 
problem. They represent the way in which managerial objectives 
should be set in an estimation decision problem. As we shall soon 
see, once they are set, we may determine such things as the neces¬ 
sary sample size. Let’s consider an illustration of how a tolerable er¬ 
ror and the risk of exceeding it might be set in a given problem. 

Example 7.15 New York City receives per capita aid from New 

York State at the rate of $6.75 per person. The aid is based on the 
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50,000 people, this would mean that the city could be overnaid of.in 

«v«r neither the city nor the Mate would »t.” E 

Sf Z 

on the nature of the problem and , he ooneeqnences „f“ °n« 
decision m that problem. Thereto™, to set a tolerable e“or" 
sires*nf S ° UW l( etermiKe the consequences of errors of different 

Tte " C T“’ T “ pected from on error in 

mate may depend on subjective values. It may not be possible to 

measure it explicitly (as could be done in Example 7.15) P Thus an 
evaluation of consequences may depend to a great extent on the 
judgment of management. Nevertheless, to insure the correct state 

polio' otS “ " ,m “ mry f ““ “ the “"“Wcnce, of 

Thus, the necessary sample size depends upon a predetermined 

“s wU1 be by - J *• i-sss 

. r !i q e l l ired T Ple size also de V ends «P°n the estimator se 
lected and the method of sampling to be utilized. For example sun 

estimate We ^ " Simple rand ° m SEmple and estimate ** by the 

£ = X . 

We know that this estimator, x, is approximately normally dis 
«nb*d „th rts mean equal E (l) = „ „ d 



212 STATISTICS FOB BUSINESS DECISIONS [Oh. 7 

(assuming simple random sampling from an infinite population). 
Thus, its sampling distribution appears as ioiiows. 



A tolerable error and the risk of exceeding this error may be i 
terpreted easily in terms of this sampling distribution. As we 
know a*, represents a deviation of a sample mean from ,^-it rep¬ 
resents an “error” in a value of x. Hence, the tolerable error in an 
estimation problem may be equated to z<r e as follows. 

B — Z(Tx • 

Since a, = a/\/n with simple random sampling from an infinite 
population, we may write 

e = z ^k 

and solve this for n, the sample size, as follows: 

/- Z(T 

Vn = — 


2^2 


ZV 


n = 


This formula enables us to compute the required sample size m a 
decision problem in which we desire to estimate /*. 

What is the value of el This is the tolerable error which must be 
set in each decision problem and which depends on the nature o 

^ What is the value of zf First note that the risk of committing 
an error equal to or greater than e or graphically appears as the 
shaded area in the preceding chart. Of course this area may be 
larger or smaller, depending on the value of a-the normal devm • 
But the value of g is determined by the risk specified m an es 
mation problem. For example, if we were to set this risk as .05, z 
must be 1 1.96 because 5 per cent of the area under the normal curve 
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is outside of the interval the mean plus and minus 1.96 standard 
errors Other levels of risk which are used often and the correspond¬ 
ing values of 2 are as follows: 

Risk 

. 

•of. lit 

lit 

. t0 determine the necessary sample size in a problem 

involving the estimation of ^ by the formula n = zV/e 2 , we must 
know a, the population variance. This often is a problem for it 
requires a priori information about a population we are planning to 
study. Nevertheless, there are several methods which can be used 
to estimate the variance of a population a priori. Two may be 
noted here. First of all, a similar study may have been conducted 
recently and an approximation of a 2 may be derived from such 
data. Fortunately, the variability in many populations does not 
tend to change over time. For example, the sales of large retail 
stores tend to remain large, while those of small stores tend to re¬ 
main small. Therefore, even though the level of retail sales may 
c ange over time, the variability in sales remains approximately 
the same. Hence, data from last year’s study might be used to 
approximate <r for this year’s study. Secondly, estimates of may 
be based on a pilot study-a small study preliminary to the main 
sample study ; In any event, the validity of any assumption about 
the value of <r and hence, about the precision a sample yields may 
be checked after the sample is drawn. In the next section we shall 
see how this is accomplished. 

Thus, once the tolerable error (e) has been set, the risk of ex¬ 
ceeding this error determined and translated into a normal deviate 

( 2 ), and an approximation to a 2 derived, we may solve for the re¬ 
quired sample size. 

sn ^ m P le 7 - 16 In deciding upon a magazine in which to place 
an advertisement, it is desirable to know the average number of read- 
ers per copy. Assume that to obtain this information for a magazine 
a statistical survey, using simple random sampling, i s to be com 

avera^e^ “h ^ th ® tolerable error in the estimate of the 

taken in ^ bee \ as - 05 P ersons P er ™py and that the risk to be 
taken m having this error exceeded is .01. Further assume that the 

magazine m question has a circulation of approximately 1000 000 
copies and that the standard deviation of the population is esSS 
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to be 2 persons. The size of a simple random sample for such a study 
can be determined as follows: 


(2.58) 2 (2) 2 _ 26.6256 
” ~ (.051 2 .0025 

n = 10,650. 

You will note that although N is finite, equal approximately to 
1 000 000 we have used the formula to determine n for an infinite 
population. When N is large, as it is here, this formula yields approx¬ 
imately the correct answer. We shall check our results below by a 
more precise formula for finite N. 

The formula n = zV/e 2 applies only^ for problems involving 
the estimation of /* by the estimator /»= x on the basis of a simple 
random sample from an infinite population. Similar formulas can 
be developed for other situations, including other methods of sam- 
pling, in much the same way. 

If the population being sampled is finite, you can see that we 
would require a slightly smaller sample than in the infinite case m 
order to achieve the same precision. To derive a formula for deter¬ 
mining n in the case of a finite population, again we begin with 

e = zctx > 


but now we let 

a 

l N 

— n 


= 

JN 

- 1 

If we solve 


a 

N 

— n 


e = Z Vn' 

\ N 

- 1 


for n, we find 

zVY 

” " (N - lie 2 + zV ' 

Let’s apply this result to the problem presented in Example 7.16 
where N = 1,000,000, z = 2.58, a = 2, and e = .05. Thus, using the 
formula for n from a finite population, we find 


(2.58) 2 (2) 2 (1,000,000) 

" (999,999) (,05) 2 + (2.58) 2 (2) 2 
26,625,600 
" 2,526.6231 
= 10,538, 
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which is approximately equal to the answer (10,650) derived bv the 
simpler formula m Example 7.16. ^ 

Let us now consider some of the ways n and the factors deter- 

andThe'le ^ f P ° P “ latl0n f andard deviation, the tolerable error 

Ze ated ; The f0rmuIa for - indicates that 
e smaller the population standard deviation, the smaller the size 

? e re fh U1 ?1 sa “ pIe ‘ Jt also ^°ws that the smaller the risk of ex¬ 
ceeding the tolerable error, the larger the sample size. This results 
rom the fact that the smaller the risk, the larger will be the value of 
the normal deviate, & Finally, it is easy to see that the larger an er- 
or one is willing to tolerate, the smaller the required sample- and 
that the smaller the tolerable error, the larger the required sample 
size. Thus, for Example 7.16 we found a sample of 10,538 was re- 

leveT'of °oT “l* TV* ^ ^ per COp ^ an ”k 

At th / S same level of ri sk, however, a sample of only 
2 ,635 is necessary to achieve a tolerable error of .10 persons per conJ 
-by doubling a tolerable error we may reduce the required sample 

Hut 0 tToncfifto °V •! ° riginal * the tolerable error 

cut to one fifth of its original size, the required sample is in¬ 
creased twenty-five fold. These remarks illustrate the general prop 
osition that the sample size required in a study varies inversely with 

the square of the tolerable error. U 

Careful study should therefore be given to the size of the toler- 
able error tha is set for a study. Although precision is, of course 

,h “ is 8r " to th ” re » uired » 

You also should note that in setting the risk of exceeding the tol¬ 
erable error, we implicitly set risks for exceeding other errors as Wdl 
Thus, again refer to Example 7.16 and note that a sample of ap’ 
proxmiately 10,600 is required to keep the risk of exceeding the tol¬ 
erable error of .05 persons per copy at the .01 level. Of course the 
risk of exceeding an error of .04 persons per copy or .03 persons per 
copy or less is greater than .01, while the risk of exceeding an rrer 

01 In™i P f r or C ° Py ° r -° 7 P T M ^ C ° Py ° r more * ^an 
greater ^heTrisk ^of ^2^ 

Formulas for determining the required size of a simple random 
sample m problems involving the estimation of ^ or A or any other 

Smrio P n a of me fY“ ay b ®, derived in a W si ™lar to those for the 
estimation of (You might try to derive one or two in exactlv the 

same way we derived the formula for the 
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estimate /*.) Thus, to determine the sample size necessary for the 
estimation of tt we let 

e = zap 


with 


and find 


4- 


(1 - TO 
n 


__ ~ T ) . 


This assumes, of course, simple random sampling from an infinite 
population. For a finite population the formula for n is 

g 2 ]W(l — ir) __ 

n (N — l)e 2 + z 2 tT( 1 — ir) 

These formulas for n depend on the assumption that the normal 
curve adequately describes the sampling distribution under consid- 

To^ply either of these formulas for problems involving the 
estimation of *, we must know x—the very thing we wish to esti¬ 
mate! Again, however, note that a rough idea of the value of - may 
suffice to determine the necessary sample size. In contrast the 
purpose of the study itself is to estimate - more precisely and ob¬ 
jectively. Thus, the validity of any assumption about the value of 
w and, hence, the precision of a result may be checked on the basis 

of the sample. 

Example 7.17 Consider the problem of estimating the proportion 
of families in New York City who are listening to a given television 
program. Suppose that the tolerable error is set at 4 Percentage 
points and the risk of having that error exceeded is de¬ 

termine the size of a sample needed to obtain the specified degree of 
precision, we must assume some approximate value for * Thus, on 
the basis of past experience, we might assume *• approximately equals 
.20. N is rather large, and, hence, we may determine the sample size 
by the formula 


2 2 tt(1 

n —-y; 


r) 


Since the risk is .05, z equals 1.96. Furthermore, e has been set at .04 
and t r estimated as .20. Thus, 


n 


(1.96) 2 (.20)(.80) 
= .884 . 
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Our result indicates that a sample size of about 384—say 400 for con¬ 
venience—should yield an estimate of tt within ±.04 with only a 5 

per cent risk of exceeding that error. 

It also may be noted that if we lack any other way of approxi¬ 
mating the value of tt a priori, we may assume that the population 
is as variable as possible—that tt = .5. This makes tt (1 — tt) as 
large as possible and, hence, for a specified error and risk requires 
the largest sample size. Hence, this assumption guarantees that the 
sample size will be large enough to yield a desired tolerable error 
and risk, although it may make it far larger and more costly than 
necessary. 

The formulas for determining the sample size necessary to esti¬ 
mate decision-parameters require that one center on a tolerable er¬ 
ror and a risk of exceeding that error in the planning stage of a sta¬ 
tistical study. This is an objective approach to the problem of deter¬ 
mining sample size. Of course, after one sets an error and a risk and 
determines n, it may appear that the cost of the sample is prohibi¬ 
tive, or it may appear that the cost is only moderate and could be in¬ 
creased. Hence, in order to control directly the cost of a study, one 
common method is to allocate a fixed amount of money for a study 
and to select as large a sample as possible for that sum. However, it 
must be remembered that any specified sample size will involve 
some error and some risk. An arbitrarily set amount of money, then, 
may mean too much error and risk—it may be better to spend more 
or give up the study entirely. Alternately, it may mean too little er¬ 
ror and risk—less money could be spent. Isn't it better, then, to 
focus on error and risk first to see what sample size is necessary to 
achieve these goals? As we know, if the cost of attaining such an er¬ 
ror and risk are prohibitive, either the specified error or the risk 
might be adjusted. 

7.5 The Precision Attained 

After a sample has been selected, the pertinent observations 
made, and estimates of the decision-parameters computed, it is pos¬ 
sible to estimate the precision or reliability of the estimating proce¬ 
dure from the sample itself. That is, we can estimate, on the basis of 
the sample, the standard errors of the estimates prepared. 

For example, to estimate the average price of a product we would 
set up a statistical decision rule which would include the sample 
size necessary to achieve a specified tolerable error and risk of ex¬ 
ceeding that error. But the sample size, as we saw in the last section, 
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is determined on the basis of an assumption about the value of the 
population standard deviation (or the population proportion). 
However, this assumption may not be completely accurate. Hence, 
subsequent to the selection of the sample, an estimate of the stand¬ 
ard error of the estimated mean or proportion will indicate the ac¬ 
tual precision of our estimating procedure. 

The computation of the attained precision of an estimate is im¬ 
portant whenever the estimate is to be used by someone other than 
the persons conducting the study. The standard error of the esti¬ 
mate gives, in this case, an objective measure of the variability of 
the sampling procedure and, hence, of the reliability of the results. 
Thus, a jurist in evaluating sample results submitted in evidence 
may use the standard error to evaluate the procedure by which the 
results were obtained. As we already know, no one ordinarily can 
evaluate a single sample result in itself. 

The attained precision also is important to those conducting a 
sample study as a basis for their own decisions. Thus, if the stand¬ 
ard error of an estimate turns out too large and, hence, indicates 
that the estimates are not as precise as they should be, a supplemen¬ 
tary sample may be chosen. Estimates then would be based on the 

combined sample. . 

As we know, the variance of the sample mean, with simple ran¬ 
dom sampling, is 

* a 2 N — n 
"nN~ l' 

This measure of the precision of a sampling procedure may be esti¬ 
mated from the sample simply by estimating <r 2 . As was noted m 
Example 7.8, an unbiased estimator of the population variance is 


or, approximately, 


where s 2 is the sample variance. This formula may be used to esti- 
mate o- 2 on the basis of a sample and that value used, in turn, to 
estimate <*/ by the formula 

, , N — n 
ai ~ n N - 1 

This is an unbiased estimator of «•/. Let’s apply it. 
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Example 7.18 Reconsider the sample study planned in Example 
7.16 and assume that a simple random sample of 10,538 subscribers 
to the magazine has been selected is order to estimate the average 
number of readers per copy. The sample mean is 3.4 readers per copy 
which we may take as an estimate of jx. ’ 

In addition, we may estimate a- x as a measure of the precision of 
the sampling procedure. Thus, suppose that we find the variance of 
the sample to be 3.6. An estimate of the population variance on the 
basis of this sample statistic is as follows: 


n 


_ 10,538 
10,537 
= 3.6. 


X 3.6 


(As you can see, for a large sample size the estimated population 
variance is approximately equal to the sample variance.) For an 
estimate of <r/ we thus find (ignoring the f.p.c.) 


3.6 

10,538 
= .000342 . 

Hence, the estimated standard error is 

tz = V7000342 
= .018 , 

which is an estimate of the precision attained in the study. 

In Example 7.16 the tolerable error in the estimate of a was set as 
.05 readers per copy with a risk of exceeding this error planned as .01. 
lhe determination of the sample size necessary to meet these obiec- 
tives was based on the assumption that <r = 2. Now, from the sample 
we have estimated 


<f 2 = 3.6 

and, hence, 

<? = V333 
= 1.9. 

In view of the fact that this estimate approximately equals the value 
for a which we assumed to determine the necessary sample size it ap¬ 
pears that our assumption was reasonable and that our study 'ap¬ 
proximately achieved the planned precision. 

For problems in which *■ is estimated by the sample proportion of 
a simple random sample, the variance of the estimating procedure 
is, as we know, 


7 r(l — 7 r) N — n 
n N ~~T 


<r. 


2 _ 
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This may be estimated from a sample 4 simply by using p as an esti¬ 
mate of *•. Thus, 

. „ p(l - P) N - n 
*■* = N - 1 

Example 7.19 In Example 7.17 we found that we needed a simple 
random sample of about 400 families in order to estimate the propor¬ 
tion of families in New York City who were listening to a given tele¬ 
vision program with a tolerable error of 4 percentage points and a 
.05 risk of exceeding that error. Assume that this sample is drawn 
and that it is found that 30 per cent of the sample families were lis¬ 
tening. This 30 per cent provides an estimate of w, the population 

proportion. , , 

The attained precision of this study can be estimated by using 
p = .30 as an estimate of the population proportion. Thus, an esti- 
mate of <r p is (again ignoring the f.p.c.) 


(.30) (.70) 

- \ 400 

= VX100525 
= .023 . 

In contrast, we assumed that , = .2 in order to determine the re¬ 
quired sample size. If we were to measure the precision on the basis 
of our assumption, it would be _ 

I (.20) (.80) 

ffp ~ \ 400 

= v')000400 
= .020 . 

Hence, we estimate that the attained precision is slightly less than 
the planned precision because the value of <r p estimated from the 
sample is greater than the value of <r p that we assumed m determin¬ 
ing the required sample size. This further indicates that the actual 
risk of exceeding the tolerable error of 4 percentage points may be 
slightly higher than the planned risk of .05. 

7.6 Interval Estimation 

In the first section of this chapter we distinguish between a point 
estimate and an interval estimate. The latter, as you will recall, 
takes the form of a range of estimates about the value of a decision- 

* An unbiased estimator of <r,> would ordinarily include n -1 rather than « m 
the denominator. This is a small difference, and the use of n has become common m 
statistical practice. 
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parameter For example, we might estimate that the average a-e of 
* sroup of employees i» between 38 end 46 ye.m. This of 

values is an interval estimate. g 

Estimates in the form of intervals usually are called confidence 
intervals m the field of statistics. You see, the purpose of any inter¬ 
val estimate is to try to present a range of values which will include 
the true value of the decision-parameter under consideration. How¬ 
ever, since interval estimates are based on sample information we 
cannot expect them to be infallible. Nevertheless, a given procedure 

certain probability of including the true value of the decision-pa 
rameter. This probability may be taken as a measure of our con- 

Srs ie used Hen “’ ihe <- 

For example, the interval of 38 to 46 years, referred to above 
may or may not include the true mean age in the population. How-’ 
ver, suppose we knew that if the procedure used for constructing 
that interval estimate were to be used repeatedly, 90 per cent of the 
resulting intervals would include the true mean. This relative fre¬ 
quency or probability of .90 would measure our confidence in the 

E “■ “ *“ y be re,e ™ i ,o »* 4- 

How can we construct interval estimates? How can we measure 

^ at W<3 ™ ay haVe “ a given Procedure? To answer 
t ese questions let us see what would happen in a given situation 

i we repeated such an estimation procedure over and over again 
Suppose that we were studying the cold cereal consumption of 
all families m the United States and that we were interested in estF 
mating the average of this population (in terms of ounci per 

' , US ’ a sample of size 900 is to be selected and an interval 
estimate of /<• computed. The minimum value of the interval is to 
be computed by the formula 8 t0 

x - 1.96<r*, 

SE ™' Ue the iWerV * 1 “ to * by the 




Let us see what results the above procedure would yield if it 
were applied repeatedly to a given population. Assume Sen ha 

we know , = 400 end . = 120 lor the pop„„. ti „„. If 
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then the sampling distribution of the sample mean for samples of 
size 900 would be normally distributed, with E{x) - 400, and with 


a> " Vn 
_ 120 
“ V900 
= 4 

(ignoring the finite population correction factor which should be 
approximately equal to 1 in this problem). 

If a sample of 900 were drawn from such a population and it 
x = 405, we would construct the following interval. 


x - 1.96a-* = 405 - (1.96) (4) 
= 405 - 7.84 
= 397.16 , 


x + 1.96a, = 405 + (1.96) (4) 
= 405 + 7.84 
= 412.84 . 


This interval of 397.16 to 412.84 includes the population mean 
( M = 400) of our assumed population. However, if our sample mean 
had been 410, the interval would have been 


x - 1.96a, = 410 - (1.96) (4) 
= 402.16 , 


x + 1.96a* = 410 + (1.96) (4) 
= 417.84 . 


This interval would not include the true population mean. 

The diagram below illustrates a series of possible mtervals. Each 
dot represents a sample mean; each vertical line represents the 
range of the resulting interval estimate. 


420 h 


415 


410 


Q_ 

2 

«/> — 
z x 

8z 405 

_■ o 
< 2 

Soi 400 


So 395 


390 


385 


380 


H-400 


5 6 7 8 

SAMPLE NUMBER 


10 11 U 
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The horizontal line (at p = 400) indicates the population mean. 
Thus, of the 12 intervals diagrammed, we can see that only 2 fail 
to include the population mean; that is, only 2 do not intersect the 
horizontal line at p = 400. 

More fully, you can see from the following graph that any sam- 
pie mean that is at a distance of 1.96o-^ (7.84 ounces) or less from 
the population mean, in either direction, will lead to intervals that 
include the population mean. That is, only values of x between 
392.16 and 407.84 will result in an estimate that includes the popu¬ 



lation mean (p = 400). Any sample mean less than 392.16, or more 
than 407.84, will lead to intervals that do not include the popula¬ 
tion mean. 

Now, suppose that we were to set up interval estimates by re¬ 
peatedly drawing samples of 900 from our assumed population 
(with p — 400 and o- = 120). We can reason that 95 per cent of 
these intervals would include the population mean. This is so be¬ 
cause 95 per cent of the sample means will be included between 
the population mean plus and minus 1.96®* (between 392.16 and 
407.84). Only 5 per cent of the sample means will fall oufside of 
this interval. Hence, only 5 per cent will lead to interval esti¬ 
mates that do not include p = 400. 

The 95 per cent confidence interval is a consequence of our use of 
1.96®* in computing an estimate. The 1.96, you will recognize, is a 
normal deviate. In a similar manner, by choosing the proper normal 
deviate, we can set up intervals with any degree of confidence that 
we desire. Thus, the use of a normal deviate equal to 2.58 involves 
a risk of .01—and, hence, a confidence interval of .99. A .99 confi¬ 
dence interval is one derived by a procedure which if repeated in¬ 
definitely would result in only .01 of our interval estimates not in¬ 
cluding the population mean. 
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In an actual statistical problem of estimation, we do not know 
the population mean. Therefore, we have no way of knowing 
whether the one interval, x ± 1.96o*, that we set up includes or 
does not include the population mean. However, we do know that 
the interval which we set up was determined by a procedure which 
if repeated again and again would yield a set of intervals, .95 of 
which would contain the population mean. 

Another problem also presents itself. We ordinarily do not know 
o-, and, therefore, we cannot compute o^. However, we may estimate 
& and hence o* on the basis of the sample in the way considered in 
the last section. Thus, an interval estimate for is given by 

x — zdv and x + z&$ . 

The normal deviate (z) is governed by the choice of the confidence 
coefficient. 

Example 7.20 A simple random sample of 100 is drawn by a 
trade association from its membership of 500 firms in order to esti¬ 
mate the average hourly wage for milling machine operators. A .95 
confidence interval is to be calculated. It is found that the sample 
mean is $2.48 and the standard deviation of the population is esti¬ 
mated at $.50. The confidence interval is given by 

X — z&s x + Z&x 

with z — 1.96 because a .95 confidence interval is to be computed. 
The standard error of the mean may be estimated as follows: 

& \N - n 
** " VnNN - 1 

_ ,50_ koQ 

" V100 \ 499 
= .045 . 

Thus, the confidence interval is 

2.48 - (1.96) (.045) 2.48 + (1.96) (.045) 

2.48 - .09 2.48 + .09 

2.39 2.57 

We thus estimate that the population mean is included in the in¬ 
terval, $2.39-$2.57. Of course, we do not know whether this one 
particular interval contains the population mean, However, we can 
measure the reliability of the procedure used in computing this inter¬ 
val estimate. Its reliability is indicated by the confidence coefficient 
of .95. If many intervals, such as the $2.39 to $2.57, were computed 
by repeating our sampling procedure over and over again, about 95 
per cent of such intervals would include the true value of fx. 
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The theory of confidence intervals for other decision-parameters 
is quite similar to that sketched here. Thus, for an interval esti¬ 
mate of 7 r we simply compute 


V ~ z#p and p ~fi z& p , 

which give the lower and upper limits of the interval. 

Example 7,21 Suppose that the simple random sample of 100 
firms considered in the previous illustration also is used to estimate 
the proportion, tt, of firms working 3 shifts. Forty of the 100 sample 
firms operated 3 shifts at the time of the survey, so that p = .40. Sup¬ 
pose that we compute a .99 confidence interval estimate of tt. This 
confidence interval is given by 


V ~ p + z& p 

with z — 2.58. The formula for estimating o>, the standard error of a 
sample proportion, is 


f. = _ IpO- ~ p) In - n 

P \ n \JV - I 

-4 


(. 40) (.60) / 400 
100 \ 499 

= .044 . 

The confidence interval is 


.40 - (2.58) (.044) 

.40 - .11 

.29 


.40 ~f (2.58) (.044) 

.40 + .11 

.51 


TwmT 6 . es ^ ma ^ e population proportion is between .29 and 

.51. This is pile of many such intervals that could be derived by the 
same sampling procedure—approximately 99 per cent of such inter¬ 
vals would include the true population proportion. 


7.7 Estimation of Distributions 

In some statistical decision problems, as was pointed out in 
Chapter 3, a decision maker would like to know the whole frequency 
distribution of a population. In this section, we shall consider the 
use of the normal curve as a population model in connection with 
the estimation of population frequency distributions. Thus, we shall 
assume that the population of interest may be adequately described 
by the normal distribution. The normal curve then may be used to 

find the necessary frequencies by measuring areas under the normal 
curve. 

The estimation of a distribution by means of the normal curve is 
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a relatively simple procedure. A normal curve, as we know, is de¬ 
fined by only two parameters—its arithmetic mean O) and its 
standard deviation (A). One need not, in such a case, determine all 
of the many possible irregularities in the exact population fre¬ 
quency distribution. To derive estimates of and <x is sufficient. 

Example 7.22 A company which manufacturers automobile bat¬ 
teries considers several types of guarantees that it might offer to cus¬ 
tomers. One suggestion is to offer a complete refund on any batteries 
that fail within 1 year coupled with a partial refund on any that fail 
during the 13th to the 24th month. Another suggestion includes a 
double-your-money-back offer. All involve some set of offers scaled 
to the month in which the battery fails. 

The decision problem is to pick one guarantee. Officials of the 
company, however, are uncertain about the expected cost of each of 
their alternatives. In order to calculate the expected cost, per bat¬ 
tery, for each guarantee they must know the number of batteries 
which are expected to fail in the 23rd month from date of purchase, 
in the 24th month from date of purchase, etc. That is, the firm must 
know the frequency distribution of the times-to-failure of the bat¬ 
teries its manufacturing process can turn out. 

Suppose it can be assumed that this population of times-to-failure 
is normally distributed. In order to calculate an approximation to 
the frequency distribution based upon this assumption, only the 
mean and standard deviation of the population need be determined. 
Thus, suppose it were estimated that p = 24 and a = 2 months. 

The relative frequency of failures expected between 24 and 25 
months (the 25th month of use) can be determined by measuring the 
following area under the normal curve: 



In order to determine this area we compute the normal deviate 

X-n 


z = 


cr 

25 - 24 


- 0.5 
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? f f re Q aS 0 “ n ( der the normal curve indicates a tail area (past 
is 5 thu°s thS ° f 3085 f ° r * ~ °' 5 ' The central area ( 24 to 25 months) 


• 5UUU 


.3085 = .1915 


winch represents the relative frequency of failures expected between 

and 25 months. 

Similarly, we may determine that the relative frequency of failures 
expected between 24 and 26 months (the 25th and 26th months of 
use) is 3413 (the area under the normal curve from « to u + *). 
Thus, the relative frequency of those failing in the 26th month (be¬ 
tween 25 and 26 months) is 


.3413 (failing between 24 and 26 months) 
~~‘1915 (failing between 24 and 25 months) 
.1498 


Furthermore, to find the relative frequency of those expected to 

fai : ^noo month ( between 19 and 20 months), fhev would note* 
(a) .4938 is the area within the_ interval 19 to 24 months (p - 2.5o- 
to (o) .4772 is the area within the interval 20 to 24 months 

• moo t0fX jJJ nd ’ hence the area between 19 and 20 months 
is .4938 - .4772 = .0166. 


Similar calculations would lead the firm to the following relative 
frequency distribution (which you should verify): 


Expected Month 
of Failure 


17th, 
18th, 
19th, 
20th. 
21st. 
22nd 
23rd. 
24th. 
25th. 
26th. 
27th. 
28th. 
29th. 
30th. 
31st. 
32nd. 


Months of 
Use 


16 and under 17 

17 

it 

u 

18 

18 

a 

u 

19 

19 

a 

<< 

20 

20 

a 

tt 

21 

21 

a 

tt 

22 

22 

a 

tt 

23 

23 

a 

it 

24 

24 

tt 

tt 

25 

25 

tt 

tt 

26 

26 

u 

tt 

27 

27 

tt 

tt 

28 

28 

a 

tt 

29 

29 

u 

tt 

30 

30 

a 

tt 

31 

31 

a 

tt 

32 


Relative 

Frequency 


.0002 

.0011 

.0049 

.0166 

.0440 

.0919 

.1498 

.1915 

.1915 

.1498 

.0919 

.0440 

.0166 

.0049 

.0011 

.0002 


1.0000 


In applying the normal population model, or any other model, 
m the way outlined, it must be remembered that unless the model 
used is a good replica of the population, the estimated frequency 
distribution it provides will be considerably biased. Thus, if one is 
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thought necessary, a model must be selected carefully on the basis 
of the combined judgments and experiences of management and 
the statistician. There also exist certain statistical methods for de¬ 
termining the adequacy of models, but they are beyond the scope 
of this text. 

7.8 You Should Now Know That 

A statistical estimation problem involves selecting, on the basis 
of sample information, an estimate which approximates the value 

of a decision-parameter. . , , 

A point estimate takes the form of a single value; an interval 

estimate the form of a range of values. 

A decision rule in an estimation problem must specify a method 
of sample selection, the size of the sample, and an estimator. 

An estimator is a formula which translates sample observations 
into an estimate of a decision-parameter. 

Among the statistical criteria for evaluating estimators are con¬ 
sistency, unbiasedness, and efficiency. , , 

The choice of a method of sampling depends on both the cost 

and the precision of available methods. . 

One method of determining sample size in an estimation problem 
is to specify a tolerable error in an estimate and the risk of exceed¬ 
ing that error, and then solve for the required n by the appropriate 

After the sample has been drawn and an estimate calculated it 
is desirable also to estimate the standard error of that estimate. This 
is an indicator of the attained precision. ... , 

A confidence interval is a range of values which, it is hoped, 
includes the true value of a decision-parameter. Any one interval 
estimate may or may not include the true value. However, the con¬ 
fidence coefficient, e.g., 95 per cent or 99 per cent, measures the 
probability that the procedure used will result m an interval actually 
including the true value. 

When it is appropriate, the normal population model may be 
used in the estimation of a population frequency distribution. This 
requires only the estimation of ** and a- and the computation of fre¬ 
quencies by measuring areas under the normal curve. 

7.9 You Should Now Be Able to Solve 

1. A statistics instructor conducts a sampling experiment in 
class. A simple random sample of 5 students is drawn from the class 
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of 40. The average weight of the students in the sample is 160.2 
pounds. This is an estimate of the population (class) average. The 
instructor also requests one student to select a judgment sample of 
5 students. The result is a sample mean of 153.8 pounds which is an¬ 
other estimate of the population (class) average. 

Subsequently, a complete survey of the class shows the true value 
of the population mean to be 155.1 pounds. Should the instructor 
and the class conclude that the mean of a judgment sample is a bet¬ 
ter estimator than the mean of a simple random sample for estimat¬ 
ing the average weights of students in statistics classes? Why or 
why not? J 

2 . Which method of sample selection (simple random, stratified, 
or cluster) would you recommend in the following situations? Ex- 
plain your answers. 

a) A study to indicate the attitude of factory workers toward the 
establishment of a company cafeteria. 

b ) A study to estimate the number of vacant dwelling units in a city 
whose population is over 1 million. 

c) A study to estimate the proportion of defective toothbrush handles 
produced by a given manufacturing process. 

d) A study to estimate the inventory of a department store 

e) A study to estimate the proportion of high school seniors in a given 
state who plan to enter college. 

3. You are to conduct an opinion poll to determine the opinions 
of residents of a given community about a projected industrial de¬ 
velopment program. How large a sample should you select to esti¬ 
mate the proportion of adult residents favoring the projected de¬ 
velopment? Make all assumptions necessary to determine the sam- 
pie size, and justify these assumptions. 

4. An aircraft parts manufacturer wishes to determine the aver¬ 
age shearing strength of a certain type of weld in order to submit a 
bid for a contract to produce these parts. What size sample is re¬ 
quired so that the risk of exceeding an error of 20 pounds or more is 
.005? Assume that <r is 100 pounds. 

5. The market research department of an electric appliance man¬ 
ufacturing concern has been allocated $500 to conduct a survey in a 
given state to determine the proportion of retailers who prefer a 
new design for a kitchen clock to the current design. The firm is to 
use this information as a guide in deciding whether or not to market 
the clock. For the estimate to be of use it is desirable that the error 
should not exceed 5 percentage points. 
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For the amount budgeted, can the market research department 
provide the firm with an estimate of the required precision? Ex¬ 
plain your answer. (Assume that the cost of an interview is $1.25, 
that tt is about .50, and that simple random sampling is to be em¬ 
ployed.) . 

If you were the department head, what suggestions would you 

make to management? 

6 . A ratio-delay study is conducted to determine the proportion 
of time a key-punch operator spends in unproductive work. A ratio- 
delay study is conducted by making observations at instants of 
time that are selected at random. At each instance the key-punch 
operator will be or will not be engaged in productive activity. 

a) How large a sample is needed to determine the proportion of un¬ 
productive time so that an error of 3 percentage points is not ex¬ 
ceeded more than 1 time in a hundred? (Assume tt is .10.) 

b) How large a sample would be needed so that the risk of exceeding 
an error of 6 percentage points or more is .01? 

c) If we assume tt is .20, would a larger or smaller sample be re¬ 
quired? Explain. . 

d) If we are willing to risk an error of 3 percentage points or more 
5 times in a hundred, will the sample size be smaller or larger than 
in (a) ? Explain. 


T. A sample of 500 accounts receivable is selected from the 4,032 
accounts that a firm has, and the sample mean is found to be 
$242.30. The sample standard deviation is computed to be $3.20. 
Set up a .99 confidence interval estimate of the population mean. 
How do you interpret the meaning of this interval? 

8. A survey on consumer finances reports that 33 per cent of 
a sample of 2,600 spending units expected good times during the 
next 12 months. Assume that a simple random sample was used 
in the study. Set up a .95 confidence interval estimate of the popu¬ 
lation proportion of spending units expecting good times. 


7.10 You Will Also Find That 

For further discussions of the topics covered in this chapter and 
for some supplementary ideas on estimation, the following are key 
references: 

Hirsch, Werner Z. Introduction to Modern Statistics, pp. 125-37, 154- 
58. New York: Macmillan Co., 1957. 

Rosander, A. C. Elementary Principles of Statistics, pp. 229-309. New 
York: D. Van Nostrand & Co., Inc., 1951. 
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Wallis, W. Allen, and Roberts, Harry V. Statistics: A New Ap¬ 
proach, pp. 443-70. Glencoe, Ill.: The Free Press, 1956. 

Neter, John, and Wasserman, William. Fundamental Statistics for 
Business and Economics, pp. 307-47. New York: Allyn & Bacon, Inc., 



CHAPTER ■ B 


Risk and Uncertainty in 
Testing Problems 


8.1 Risks in Testing Problems 

In the preceding chapter we considered the problem of formulat¬ 
ing decision rules for estimation problems. In this chapter we shall 
develop methods for formulating decision rules for testing prob¬ 
lems. A testing problem, as you should remember, is one in which a 
decision maker must choose between two or more qualitative 
courses of action. A testing problem can be treated statistically if, 
and only if, the choice of an alternative course of action depends 
exclusively on knowledge of the value of some decision-parame¬ 
ter (s). For simplicity in our introductory survey of statistics we 
shall concentrate on two-action testing problems—that is, on those 
problems in which the decision maker has only two alternatives to 
choose from. For example, accepting or rejecting a lot of raw mate¬ 
rials, marketing or not marketing a new product, and hiring or not 
hiring an applicant for an executive training program are all illus¬ 
trations of two-action testing problems. Remember, however, that 
multiaction testing problems often arise in the business world. 

One basic question to be answered in this section is: “What do we 
mean by risks in a testing problem?” We have already answered this 
question here and there throughout the first seven chapters. But be¬ 
cause this material is here and there and not centralized in any one 
section, we had better co-ordinate it right now. To review certain 
ideas about risks in testing problems, let us consider the case of an 
advertising manager who must decide whether or not to place a par- 
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ticular advertisement in a local newspaper. Suppose that he were to 
flip a coin to reach his decision. If a head appeared, he would place 
the advertisement, and if a tail appeared, he would not. Then, if an 
advertisement were a potentially effective one (and, therefore, 
should be inserted), the advertising man would be running a 50 per 
cent risk of making the wrong decision (not inserting the advertise¬ 
ment). Furthermore, if the advertisement were a potentially inef¬ 
fective one, the coin-flipping decision procedure would result in a 
50 per cent risk of making the wrong decision (inserting an ineffec¬ 
tive advertisement). 

In short, this procedure and the probabilities associated with it 
might be summarized as follows: 


Alternative 

State of the World 

Effective 

Ineffective 

Place the ad.... 

! Correct decision 

[ Incorrect decision 


.50 

.50 

Do not place the ad.. . 

Incorrect decision 

Correct decision 


.50 

.50 


Notice that all the probabilities are conditional probabilities—they 
depend upon the state of the world. For example, the first .50 tells 
us that if the advertisement is a potentially effective one, the coin¬ 
flipping procedure would give us a .50 chance of making the correct 
decision—placing the advertisement. The other probabilities are to 
be interpreted similarly. 

Of course, flipping a coin to make advertising decisions is a naive 
procedure. The risks entailed are great. A company in the long run 
would have a small chance of survival if half of the potentially in¬ 
effective advertisements were placed and half of the potentially ef¬ 
fective advertisements were not used. In fact, no advertising direc¬ 
tor is needed for such a decision procedure. His training and experi¬ 
ence are not drawn upon. 

By contrast, an advertising manager, before making a decision, 
might desire to select a sample to determine the proportion of peo¬ 
ple who see and remember the advertisement. This procedure re¬ 
duces his uncertainty and the risks of making errors in the classifica¬ 
tion of advertisements. But to go to another extreme, note that 
every advertising director would like to be able to boast about the 
following probabilities: 
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Alternative 

State of the World 

Effective 

Ineffective 

"PI a op the ad. 

Correct decision 

1 

Incorrect decision 

0 

Incorrect decision 

0 

Correct decision 

1 

Do not place the ad. . . 


This situation means that a correct decision would always be made. 
While this is a highly desirable situation, it is ordinarily impossible 
to achieve when decisions are made in the face of uncertainty. The 
only way in which the incorrect decision of neglecting to place po¬ 
tentially effective advertisements can be avoided is to place every 
advertisement. But, this would mean that we could never avoid the 
error of placing a potentially ineffective advertisement. Similarly, 
the only way we could avoid the placing of an ineffective advertise¬ 
ment would be not to place any advertisement. This, however, 
would never give an effective advertisement the possibility of being 
inserted. 

Although, it is impossible to eliminate either type of incorrect 
decision, statistical theory enables us to control the relative frequen¬ 
cies of these errors. The level at which these risks are to be con¬ 
trolled will depend upon the consequences of an incorrect decision 
and the costs of sampling. Thus, the consequences of placing an in¬ 
effective advertisement involve the needless expenditure of funds 
to insert the advertisement and perhaps a loss of sales. The conse¬ 
quences of neglecting to place an effective advertisement involve 
the expense of preparing the advertisement and the potential loss 
of sales because the advertisement was not inserted. The risks de¬ 
cided upon should be those that result in minimum economic losses 
when the costs of sampling and the consequences of both types of 
errors are taken into consideration. 

After such an analysis of risks and costs, the advertising director 
might decide that he is willing to assume the following risks: 


Alternative 

State of the World 

Effective 

Ineffective 

Place the ad. 

Correct decision 
.95 

Incorrect decision 
.05 

______ 

Incorrect decision 

.10 

Correct decision 
.90 

Do not place the ad.. . 
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Thus, the advertising manager would be satisfied with a decision 
procedure for which the probability of neglecting to place an effec¬ 
tive advertisement is .05 and the probability of placing an ineffec¬ 
tive advertisement is .10. 

These two types of incorrect decisions have special statistical 
names. They are called errors of the first kind and errors of the sec¬ 
ond kind . They also are referred to as Type I errors and Type II 
errors, respectively. By convention, we designate as the error of the 
first kind (Type I) the error that we are most eager to avoid. When 
we are just as eager to avoid both types of error, generally either 
error may be referred to as the Type I error. In addition, the prob¬ 
ability of incurring a Type I error is termed the a-risk (alpha risk), 
and the probability of incurring a Type II error, the /3-risk (beta 
risk). Thus, the error of neglecting to place an effective advertise¬ 
ment is a Type I error since this is the risk the advertising manager 
is more eager to avoid and the probability of the occurrence of such 
an error, the a-risk, is .05. The error of placing an ineffective adver¬ 
tisement is the Type II error, and the probability of its occurrence— 
the £-risk is .10. Notice that a Type I error can occur only if the 
advertisement is effective and a Type II error can occur only if the 
advertisement is ineffective. Therefore, the probability that an er¬ 
ror will occur is always conditional upon the existence of a particu¬ 
lar state of the world. 

The advertisement classification problem as we have formulated 
it cannot be treated statistically. To do so, it is necessary to de¬ 
scribe the states of the world in terms of a decision-parameter. Sup¬ 
pose that the advertising manager selects as the decision-parameter 
the proportion of readers who see and remember the advertise¬ 
ment. The states of the world, therefore, could be described by 
values of tt ranging from .00 (none remember) to 1.00 (all remem¬ 
ber). However, the statistical technique that we shall employ to 
develop decision procedures requires that we focus our attention on 
only two of the many possible descriptions of the state of the world 

i.e., two values of ? r. Thus, the advertising manager might select 
* = .25 as the point at which he would like to be almost sure of us¬ 
ing the advertisement. In other words, tt = .25 describes a state of 
the world which is associated with a potentially effective advertise¬ 
ment. To describe a state of the world associated with a potentially 
ineffective advertisement the manager might select tt == .10—a point 
at which he would hesitate to place the advertisement. We can re¬ 
state the problem as follows: 
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Alternative 

State of the World 

7T = .25 

7r = .10 

Place the ad. 

Correct decision 
.95 

Incorrect decision 
.05 

Incorrect decision 
.10 

Correct decision 
.90 

Do not place the ad.. . 


The advertising manager thus indicates that he would like to in- 
sert an advertisement which 25 per cent of the readers will see and 
remember. However, after considering the costs of sampling and the 
costs of incorrect action, he is willing to run the risk of not placing 
such advertisements only 5 times in 100. On the other hand, he 
would prefer not to place advertisements if only 10 per cent of the 
readers will remember them. But after his consideration of the costs 
of sampling and of incorrect action, he is willing to run the risk o 
placing such advertisements 10 times in 100. _ 

The aspects of testing problems that we have been considering 
the specification of alternative courses of action, the selection of a 
decision-parameter, the particular values of the decision-parameter 
upon which to focus, and the specification of the a- and /J-risks—are 
primarily responsibilities of management. However, the statistician 
usually must take an active part in their selection. He must provide 
estimates of sampling costs and assist in the determination of t e 
consequences of both types of error. It then becomes the job of the 
statistician to formulate a decision rule, the application of which 
will control risks of incorrect action at the specified probabilities. In 
the next section we consider the development of such decision pro- 
cedures. 

8.2 Formulating the Decision Rule—the Proportion as a Deci¬ 
sion-Parameter 

We have seen that one way in which management sets its re- 
quirements for a decision rule is to demand that the rule.offer a 
requisite degree of protection against making incorrect decisions. In 
statistical terms, management is looking for a decision rule w ose 
application will offer assurance that: (a) the probability of a Type 1 
error will be a, and (b) the probability of a Type II error will be £. 
In this section we shall consider a method for determining such a 
rule for problems in which the population proportion is the decision- 
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parameter; that is, for problems in which v adequately describes 
states of the world. Let’s consider the following example to explain 
the method. 

Example 8.1 A marketing decision must be made on whether or 
not to market a new scouring powder. A sample of consumers is to be 
taken and the decision between marketing the product or not market¬ 
ing it is to be based on this sample. 

Management and the statistician decide to concentrate on the 
proportion of family units who say that they prefer this scouring 
powder after using a free sample for a week—this proportion for all 
family units is the decision-parameter. They further focus atten¬ 
tion on two values of ?r : tt 0 = .25, at which point it would be prefer¬ 
able to market the new product; and ^ = .40, at which point it 
would be preferable to market. 

In summary, the decision problem is as follows: 


Alternative 

State of the World 

7T o = .25 

7Ti = .40 

Market. ! 

Incorrect decision 
Correct decision J 

Correct decision 
Incorrect decision 

Don’t market. 



. _ 0ne other Preliminary consideration involves a statement of the 
risks that management is willing to take. Suppose that they are will¬ 
ing to take only a .10 chance (the 0-risk) of not marketing the prod- 
uct if 7T - .40. They also are willing to take only a .05 chance (the 
a-risk) of marketing the product if tt = .25. 

The Type I error in this problem is the marketing of the scouring 
powder if*- = .25 since this is the error management is more eager 
to avoid. The probability of incurring this error is .05 as against the 
probability of .10 of incurring the Type II error—not marketing the 
product if 7r= .40. 

Before considering the determination of the decision rule we 
shall introduce some statistical terminology that often is used in 
discussions of testing problems. This terminology, while not essen¬ 
tial to an explanation of the concepts involved in testing problems 
is widely used in the field. 

The two values of tt upon which we center our attention repre¬ 
sent assumptions that we make concerning the state of the world. 
Statisticians call such assumptions hypotheses. In the problem we 
are considering, one hypothesis is that *- = .40; the other is that 
* ~ ^ we mar ket the new scouring powder, the hypothesis 

w * s acce Pted and the hypothesis w = .25 is rejected. On the 
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other hand, if we do not market the scouring powder, the hypothesis 
7 T = .40 is rejected and the hypothesis = .25 is accepted. Thus, it 
is possible to describe the action taken in terms of only one of the 
hypotheses since the acceptance of one means the rejection of the 
other, and vice versa. The hypothesis upon which we center our at¬ 
tention is called the null hypothesis and is designated as H 0 the 
other hypothesis is known as the alternate hypothesis and is desig¬ 
nated as iJi. 

It is customary to designate as the null hypothesis that hypothe¬ 
sis which, if rejected when true, leads to a Type I error. Thus, in the 
problem under consideration, tt 0 = .25 is the null hypothesis. Why? 
If t. 25 and the scouring powder is marketed, a Type I error will 
result. Obviously, = .40 is the alternate hypothesis. The prob¬ 
ability of a Type I error (a) is thus the probability of rejecting the 
null hypothesis if it is true. The /3-risk represents the probability of 
accepting the null hypothesis if the alternate hypothesis is true. 
Problems of the type that we are considering, therefore, can be de¬ 
scribed as tests of hypotheses—to accept the null hypothesis is 
to take one action, and to reject the null hypothesis is to take the 

other. 

We return now to the formulation of a decision rule for the mar¬ 
keting problem under consideration. Generally, the rule will take 
the following form: Select a simple random sample of size n and ob¬ 
serve the sample proportion, p. If p is less than or equal to some 
critical value (which we shall denote by p*), do not market the prod¬ 
uct; if p is greater than this critical value, market the product. Ob¬ 
serve carefully the essential elements of a decision rule for a testing 
problem: 

1. A method of sample selection (e.g., simple random sampling), 

2. The sample size (n), and 

3. The critical value of the statistic (e.g., p*) which governs the ac¬ 
tion to be taken for various values of the statistic. 

Since simple random sampling is assumed, our task is to find values 
of n and p* for the decision problem under consideration. 

This decision rule must offer a degree of protection against the 
occurrence of two types of error. We may now consider these errors 
one at a time. A Type I error can occur only if * == .25 (if the null 
hypothesis is true). However, we know that the normal curve will 
approximate the sampling distribution of p, and that if * =: -25, its 
mean, E(p), also is .25. In this sampling distribution we need to 
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find a critical value, p*, so that .05 of the area of the normal curve 
IS to the right of p* as in the diagram below: 



Why .05? If y = .25, management is willing to assume only this de¬ 
gree of risk in marketing the product. Since the product will be 
marketed when we observe a value of p greater than p*, only 5 
per cent of all possible sample proportions should exceed p* if 

The table of areas under the normal curve indicates that we must 
go out 1.640-j, to reach a point in the tail beyond which .05 of the 


determined risk of .05 is to be met. Thus 


p* = .25 + 1.64 < 

But, 

_ L(l - X ) 

\ n 

Hence, 

V * = .25 + 1.64-v 


- ?r) 


.25 + 1 




(.25) (.75) 


n 


= .25 + 


■ 710 _ 

V n 


This is the relationship that p* and n must satisfy to meet the con- 
dition that the probability of a Type I error is .05. 

^Te now consider the Type II error. This error can occur only if 
~ the alternate hypothesis is true). Again we know 

that the sampling distribution of p will be approximated by the 
normal curve, and if - = .40, E(p) a l so is .40. Tor this distribution 
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we want to find another equation relating p* and n so that .10 of 
the area is to the left of p*, as in the diagram below: 



VALUE OF p 

Why .10? If 7 r — .40, management is willing to take a risk oi .10 of 
not marketing the product. The product will not be marketed if p 
is equal to or less than p*. Therefore, only 10 per cent of the possi¬ 
ble sample proportions should have values equal to or less than p* 
if*r=.40. 

The table of areas under the normal curve indicates that it we go 
out 1 ,28<r p from E(p), the area in the tail is .10. This means that 

p* = .40 - 1.28<r„_ 

= ■40 - 1 . 28 ^^ 

= .40 — U28yJ—-~— 

This is the relationship which p* and n must satisfy to meet the con¬ 
dition that the probability of a Type II error is .10. 

Earlier we also found that 


p* = .25 + 


.710 

Vn 


The values of p* and n which will satisfy both of these equations 
will be the basis for our decision rule. We can solve for n by equating 
both expressions for p* as follows: 

.710 


.40 - 


.626 
V n 


.15 - 


= .25 + 7 

Vn 

710 + .626 


Vn 


1.33 6 
.15 
= 8.91 

n = 79.3, or80. 
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Thus, the required sample size is 80. We can determine the value of 
for p y * SUbStitUting the ValUC ° f n “ either ° f the tW ° ex P ressions 


.40 - ^ 

Vn 

.626 
8.91 
.07 


- .40 

= .40 
= .33 


V * = .25 + ' 


.710 


v 7 


n 


= ,25 + 


.710 

8.91 


= .25 + .08 
= .33 


We now have determined the decision rule. It is: Select a simple 
random sample of 80 families and determine the proportion of fam- 
i ies m the sample that prefer the scouring powder. If 33 per cent or 
less favor the scouring powder, do not market it. If more than 33 per 
cent prefer the scouring powder, market it. This rule meets the risks 
that management and the statistician agreed upon. 

. In general > the procedure we have followed is: We focus atten¬ 
tion on two states of the world, «■„ (the null hypothesis) and ^ (the 
alternate hypothesis). We determine the risks we are willing to take 
m the decision problem. The general equations for determining the 
values of n and p* are then 


V* - /s>£Lr_*il 

\ n 

V * = *1 - . 

\ n 

Za and z fi are the appropriate number of standard deviations—the 
appropriate normal deviates—required to yield the predetermined 
a- and /J-risks. These two equations are solved to determine the 
sample size (n) and the critical value of p (p*). 

8.3 Formulating a Decision Rule—the Mean as a Decision Pa- 
rameter 


The previous section dealt with situations in which the popula¬ 
tion proportion is used to describe states of the world. We now shall 
consider the formulation of a decision rule for a problem in which 
the population mean is the decision-parameter. We shall relate our 
discussion to the following problem situation. 


Example 8.2 A firm packages instant coffee in small-sized iars— 
net weight 2 ounces. It is interested in controlling its production 
process; in making sure that each of he jars contains about 2 ounces. 
The company has studied its process when it was operating satis- 
c on y. t has estimated that the average contents per jar is 2.06 
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ounces, and the standard deviation of the process is .02 ounces. The 
company is satisfied with this performance and wants to maintain it 

in future production. „ , . , AO 

We shall assume throughout our discussion that o- remains at .uJ 
ounces, and develop a decision rule to control variations in (To 
control variations in cr we would need a second decision rule.) Thus, 
assume that the company plans to take a sample of n jars every hour. 
On the basis of the average weight of these n jars it will decide 
whether to leave the process alone or to stop the process m order to 
try to correct whatever is wrong. 

In addition to the use of the sample mean as a basis for decision, 
this situation differs in another important way from the one we con¬ 
sidered in the preceding section. In this problem the process is op¬ 
erating unsatisfactorily if the jars are either overfilled or underfilled. 
Therefore, a decision rule must divide all possible values of the sam¬ 
ple mean into three regions, as illustrated in the following diagram: 


Stop 

Process 

Leave 

Process 

Alone 

Stop 

Process 

X 

Values of x 

:* 


where x * represents the lower critical value of the sample mean 
and x* the upper critical value of the sample mean. 

The diagram indicates that very high values (presumably over¬ 
filling) and very low values (presumably underfilling) of the statis¬ 
tic lead to the same action, while middle values lead to the alter¬ 
nate action. Since both extremes—high and low values—lead to the 
same course of action, we have what is called a two-tailed testing 

problem. . . _ _ - , , , 

In contrast, let us reconsider the decision rule developed to de¬ 
cide whether or not to market a new scouring powder. That rule di¬ 
vides possible values of the statistic, p, into two regions: values 
equal to or less than p*, and values greater than p*. Any value 
within one of these regions leads to a unique action, as shown in the 


diagram below: 


Do Not Market 

Market 

r 



p* 


Values of p 

Since only the higher or the lower values are involved in a decision, 
we have what is called a one-tailed testing problem. 

When should a decision rule be one-tailed and when should a de- 
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cision rule be two-tailed? The answer is relatively simple. We ask 
ourselves: “Suppose that we knew the decision-parameter in the 
problem under consideration; would we take action A for high and 
low values of this parameter or would we take that action only for 
high or low values?” If the answer is high and low, we have a two- 

tailed testing problem, otherwise we have a one-tailed testing prob- 
lem. 

To return to the coffee jar problem, let us consider the general 
form of the statistical decision rule we must formulate. This form 
will be as follows: Select a simple random sample of n jars and de¬ 
termine the average weight of their contents. If this average weight 
is between S* and x*, let the process continue to run. If the average 
is equal to or less than J*, or if it is equal to or greater than x * stop 
the process. ’ 

To determine the values of n, x„ and x* for this rule it is neces¬ 
sary for management first to focus attention upon values of the 
population mean to describe states of the world. The company, as 
noted earlier, wishes to leave the process alone if p = 2.06, the nor¬ 
mal process average, and selects this as one value of p on’which to 
focus attention. The company would want to stop the process if the 
value of P - departed from 2.06 by an amount which would result in 
excessive overfilling or underfilling. It thus chooses p = 2.03 and 
P = 2.09 as the values of the decision-parameter which describe the 
alternative states of the world. The 2.03 represents a state in which 
the process is underfilling and, hence, a state for which the company 
would like to stop the process. The 2.09 represents a state in which 
the process is overfilling and, hence, also a condition which the com¬ 
pany would like to try to remedy by stopping and checking the 
process. Notice that p = 2.03 and p = 2.09 are both .03 ounces away 
from the value of p = 2.06 which describes the first state of the 
world. This symmetrical formulation simplifies the determination 
of a decision rule. 

Our decision problem can be summarized, and risks assigned, as 
follows: ’ 


Alternative 

1 State of the World 

yUO = 2.06 

m = 2.03 

ju 2 = 2.09 

Leave process alone. 

Stop process. 

Correct decision 
.99 

Incorrect decision 
.01 

Incorrect decision 
.05 

Correct decision 
.95 

Incorrect decision 
.05 

Correct decision 
.95 
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This indicates that the company is willing to take only a .01 risk of 
stopping the process if /*■ = 2.06 (if H 0 is true). Since the company 
is more eager to avoid the error of stopping the process if /* = 2.06, 
this may be regarded as the Type I error. Hence, the .01 is the 
a-risk. Our summary further indicates that the company is willing to 
take only a .05 risk of leaving the process alone if /* = 2.03; that is, 
if there is excessive underfilling. It also is willing to take a .05 risk 
of leaving the process alone if fi = 2.09; that is, if there is excessive 
overfilling. Note that the alternate hypothesis, H t) refers to the pos¬ 
sibility that /* equals either 2.03 or 2.09. Hence, there are two 
/2-risks—one if /* = 2.03, the other if /* = 2.09. 

Let us now determine the values of n, x*, and x* for the required 
decision rule. This rule must assure that the risks of incorrect de¬ 
cision are as specified above. We shall consider these risks one at a 
time. 

We first consider the a-risk, the probability of leaving the process 
alone if j* = 2.06. This has been set as .01. Note that the sampling 
distribution of the sample mean is approximated by the normal 
curve, and if /* = 2.06, its mean, E(x), also is 2.06. For this sam¬ 
pling distribution, we want to find the critical values, x* and x*, so 
that .01 of the area of the normal curve falls beyond these values 
as in the diagram below: 



The area has been divided equally between the two tails because of 
the symmetry of the problem. If /* = 2.06, the error that may occur 
is to stop the process. The process will be stopped only if the value 
of a sample mean does not fall between x* and $*. Since only .005 
of all possible sample means will be equal to or more than x , and 
only .005 will be equal to or less than x*, the probability of stopping 
the process, if /* = 2.06, will be .01—the risk set by the company. 

The table of areas under the normal curve indicates that since 
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the area in the tail is .005, the distance from E(x) to either of the 
critical x values is 2.58^. Thus, 

z* = 2.06 - 2.58(7* and x* = 2.06 + 2 . 58 ( 7 * 


Hence, 


** = 2.06 - 2.58 ^ x* = 2.06 + 2.58^- 

2 , ST,«Ce POi ‘ ,,8d ° Ut “ E " mp,e 8 ' 2 ' ' has 


x* = 2.06 - (2.58) 


x* = 2.06 + (2.58)^^ 


These are the relationships which n, x*, and x* must satisfy to meet 
2 Oe^kOl " ^ the Pr ° babillty ° f st °PP in g the process if „ = 

We now direct our attention to the second risk-the probability 

— 9 6 process alone ^ — 2.03 (excessive underfilling) If 

tbe Sam P lln S distribution of the mean will be approxi¬ 
mated by the normal curve with E(x) = 2.03. In this sampling dis¬ 
tribution on y Oo of the possible sample means must be greater 
than x* } as illustrated m the diagram below: 



2-03 

VALUE OF x 


Why .05. The company is willing to assume only this risk of leav- 
mg the process alone if „ = 2.03. Since the process will be left alone 
when we observe a value of x between x * and x*, only 5 per cent 
of ah possible means should exceed x*. (If p = 2.03, practically no 

W f 8 u, eater than Therefore - we can say that 
the proportion of possible sample means between x* and x* is the 
same as the proportion greater than .f*.) 
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Since the area beyond x* is .05, it is 1.64^ from E(x). 


Therefore, 


— 2.03 -\- 1.64a-x • 


But, 


<Jx = 


.02 


Hence, 


x# = 2.03 + 


.0328 

V n 


This is the relationship n and S* must satisfy to fulfill thej;ondi 
tion that the probability of leaving the process alone, it m - 2UA, 

We now consider the error of leaving the process alone it — 

2,09 _i. e . if there is excessive overfilling. The sampling distribution 

of the mean is again approximated by the normal curve, and 
E(x) = 2.09. In this sampling distribution, only .05 of the possible 
carrmle means must be less than x*, as shown in the diagram below: 



The risk of leaving the process alone if ,*- 2.09 is set at b5 
only .05 of all possible sample means should be less than x . ( 
a = 2.09, practically no sample means will be less than x*. there¬ 
fore we can say that the proportion of all possible samp e means 
that fall between x* and x* is the same as the proportion less than 

Since the area beyond x* is .05, it is 1.64 ^ from E{x). Thus, 






But, 


Vn 


;* = 2.09 - 


.0328 

Vn 


Hence, 
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This is the relationship that n and x* must satisfy to meet the con¬ 
dition that the probability of leaving the process alone, if , = 2 09, 

•"L.d” tw “ expre “°“ ,or of "» *. 

&* Rations Equations 

.0516 


2.06 


V n 


X* = 2.03 + ^528 
Vn 


2.06 + ^2^ 
Vn 


x* = 2.09 


We can equate each pair to solve for n. 


.0328 

Vn 


2.06 - ^516 = 2 03 ,0328 

Vn Vn 


2.09 - 


.0328 


.03 = 


.0844 


n 

03 


= 2.06 + 
.0844 


.0516 

Vn 


Vn ' w “ vT 

W - 7.9 , or 8 » = 7.9 , or 8 

We can now substitute in either of the equations for x* and z* to 
solve for the critical values. 


x* = 2.06 


- 2.06 


.0516 


= 2.06 
= 2.04 


n 

.0516 

.02 


2.06 + 


= 2.06 + 


.0516 

Vn 

.0516 


Vi 

= 2.06 + .02 
= 2.08 

. Y ° U obs ?™ tllat both sets ^ equations yield the same value 
for « and that both critical values deviate by .02 ounces from 2 06 

V, ! S suffici ent, therefore, to consider equations for either f* or 
a to determine the decision rule. This simplified result follows from 
he symmetrical ways m which we selected states of the world and 

, n .. ’ e n8ks (,H and * were -03 ounces below and above and 
both brisks were set at .05). This symmetrical formulation of the 

mffinW r iP ! S the , deterrnlna tion of a decision rule and is 
sufficient for most practical purposes. 

l su “ mar y> the statistical decision rule for the problem formu- 
ated m Example 8.2 is as follows: Select a simple random sample 
of 8 jars and determine the average weight of their contents. If this 
average w ei ght is between 2.04 and 2.08, let the process continue to 
run. If the average is equal to or less than 2.04, or if it is equal to or 
greater than 2.08, stop the process. 
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In general a procedure used to determine a decision rule for a 
two-tailed testing problem is as follows: Focus attentmn on thre 
«+ a tes of the world, ih (the null hypothesis), and h and ^ (the al 
SStelSXd.) I-t * and , 2 be 

rSUJ, nd *• i» the following eqo.t.o»s: 


X* = IM) — Za Qx 
2 

X* = Ml + ^ ^ 
and 

x* = Mo + z« <** 

2 

x* = /i<2 — zp (Jx 

Remember that a, is the value of the normal deviate for an area m 
the tail equal to one half of a, and a* is the value of the normal devi- 

o+p for an area in the tail equal to ji. 

The same procedure and set of equations can be adapted fo 
JSiSd testing problem in which , is the dccw.on-p.ramcter by 
substituting *• for /», p* for x*, p* for x , and <r p tor <7*. 

8.4 Operating Characteristic Curves 

' To develop a decision rule we had to concentrate on two possible 
values^of the decision-parameter. Thu,, w, centered our attontjon 

„„ = .26 and = .40 in P = f»“» the 

SelnZ ti,rrS.'of incorrect actio.we 
were willing to take on the assumption 

familieS migM prefe v he 
new scouring powder. There would be risks associated with taking 

action on the basis of the derived decision rule if any of these val¬ 
ues of the decision-parameter were to describe the true s a e o 

W °Thus we want a method of indicating the probability of taking 
a certain action for a given decision rule for all possible values 
of a decision-parameter. A statistical way of presenting these prob¬ 
abilities usually is called an operating characteristic curve or, more 
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briefly, an OC curve. It also has been called a performance charac¬ 
teristic curve. 

Let us now determine the OC curve for the decision rule derived 
in connection with the marketing of a new scouring powder. To de¬ 
termine an OC curve it is customary to compute the probabilities 
of taking the action which may lead to a Type II error (the prob¬ 
abilities of accepting the null hypothesis). We, therefore, shall con¬ 
sider the probability of not marketing the new product under the 
decision rule for all possible values of These values can range 
from .00 to 1.00. We now recall the provisions of the decision rule: 
If m a sample of 80 family units the value of p is equal to .33 or 
less, do not market the scouring powder; otherwise market it. 

We can determine two points on the OC curve very easily. If 
- 00 ’ the probability of not marketing the product is 1.00 since 
every value of p will be equal to .00—less than .33. If w ==. 1.00, 
we will always market the product and, hence, the probability of 
not marketing it is .00. Two other points follow from the «- and 0- 
nsks set in deriving the decision rule. The risk of marketing the 
product if *■ = .25 (a-risk) was set at .05. Therefore, the probability 
of not marketing the product if = .25 is .95. The probability of 
not marketing the product if = .40 (0-risk) was set at .10. 

A fifth point that can be determined easily is the probability of 
not marketing if «■ is equal to .33—the same as the critical value 
p . The sampling distribution of p if tt = .33 is normally distributed 
with its mean equal to .33. Since the normal curve is distributed 
symmetrically about its mean, half of the possible values of p are 
equal to .33 or less. Hence, under the decision rule the probability of 
not marketing the scouring powder if «• = .33 is .50. We summarize 
below the five points of the OC curve just derived: 


.00 

.25 

.33 

.40 

1.00 


Probability of 
Not Marketing 
. ... 1.00 

.95 

.50 

.10 

.00 


We now shall illustrate how to determine the probability of not 
marketing the product for other values of tt; for example, ir = 
.36. The statistical problem is to determine, given tt = .36, the prob¬ 
ability of getting a value of p equal to .33 or less. We have dealt 
with problems of this type in Chapter 6, when we demonstrated 
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how to measure risks by means of the normal curve. However, let’s 
review the procedure. The probability in question is illustrated in 
the diagram below: 



To determine the shaded area we compute the normal deviate, 


But, 


z — 


P — TT 


_ l W ( 1 ~ 

~ \ n 

/ (•36)(.64) 
“ \ 80 


= .054 


Hence, 


.33 - .36 
2 " .054 

= -.56 . 

When 2 = .56, the table of areas under the normal curve indicates 
that the area in the tail is .2877. Hence, the probability of not 
marketing the scouring powder, if *• = .36, is .2877. Similarly, we 
could compute the probabilities of not marketing the product for 
other possible values of *•. 

Why is an OC curve called a curve? Because the type of informa¬ 
tion we have been computing is usually presented graphically. 
Thus, for the decision rule we have been considering, the OC curve 
is as shown in Figure 8.1. You will observe that the OC curve has 
only one tail. This is a property of the OC curve for any decision 
rule relating to a one-tailed test. 

A little interpretation of the OC curve is in order. It shows con- 
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ditional probabilities. Thus, for example, it indicates that if we use 
the decision rule under consideration, and if *■ = .36, the probability 
of not marketing the new scouring powder is approximately .29 (or 
that the probability of marketing it is .71). Similar statements can 
be made on the basis of the curve for any value of w. The OC curve 
thus gives us a device for evaluating the protection offered by a 
given decision rule against taking incorrect actions. Note that the 


Figure 8.7 



larger the value of *■, the smaller is the probability of not market¬ 
ing the product, and, hence, the greater is the probability of mar¬ 
keting it. We could not have confidence in a decision procedure 
which did not exhibit this characteristic. 

Let us now develop an OC curve for a decision rule in a two- 
tailed testing problem in which the mean is the decision-parameter 
Such a rule was derived in Section 8.3: If, for a simple random 
sample of 8 jars of coffee, the average weight is between 2.04 and 
2.08 ounces, leave the process alone; otherwise, stop the process. 
For an OC curve it is customary to depict the probabilities of tak¬ 
ing the action that may lead to a Type II error. Thus, in this case 
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we shall determine the probability of leaving the process alone 

under various assumptions about p. 

To illustrate the computation of these probabilities suppose that 
we assume n = 2.05. The sampling distribution of the mean will be 
approximated by the normal curve with E(x) equal to 2.05. The 
standard error of the mean is 

°* aB VS 

V8 

= .0071 . 

In developing the decision rule it was stated that the company had 
estimated * to be .02 ounces. It was assumed that this value does 

not vary even though p does. . 

The probability of leaving the process alone is represented by 

the shaded areas in the diagram below: 



Note that the process is left alone if x is between 2.04 and 2.08. To 
measure the probability of leaving the process alone, we must add 
the areas in both central portions. To measure areas we must deter¬ 
mine the normal deviates. The normal deviates are. 

Left Tail Bight Tail 


(Tx 

2.04 - 2.05 _ 2.08 - 2.05 

~ .0071 - 0071 

= -1.41 = 4.23 

When a is 1.41, the area in the tail is .0793. Therefore, the area 

in the left central portion is .4207 (.5000 - .0793). When * = 4.23, 

the area in the tail is practically zero and, hence, the area m the 
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right central portion may be taken as .50. Therefore, the probabil¬ 
ity of leaving the process alone if /* = 2.05 is .9207. 




In a similar manner the probability of leaving the process alone 
given other values of /*, has been computed. You should verify these 
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probabilities. Again observe the symmetry of these probabilities 


about p = 2.06. 


2.02 

2.03 

2.04 

2.05 

2.06 

2.07 

2.08 

2.09 

2.10 


P (Leaving the 
Process Alone X 
. . . .0024 
... .0793 
... .5000 
.. . .9207 
.. . .9900 
... .9207 
. .. .5000 
... .0793 
.0024 


Figure 8.2 (p. 253) depicts this two-tailed OC curve. 

We turn now to an important consideration with respect to OC 
curve s—the effect of a change in sample size upon such curves. To 
illustrate this effect consider the following situation. 

Example 8.3 A company accepts or rejects lots of 10,000 machine 
parte on the basis of the following decision rule: Select a simple ran¬ 
dom sample of 200 parte. If 1 per cent or more (2 parte or more) are 
defective, reject the lot; otherwise, accept it. _ ... 

The operating characteristic curve for this decision rule is shown 
in the graph below. Notice that *, the proportion of defectives in a 
lot, is the decision-parameter. 
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In contrast, suppose that the company modified its decision rule 
to require a simple random sample of 1,000. Its rule then would be- 
Select a sample of 1,000 parts. If 1 per cent or more (10 parts or 

for r them e nH, e fi fe ? 1V( i’ rejec * the lot; other wise, accept it. The OC curve 
lor the modified rule would appear as: 



pb+m fW -f fu thlS , SeC °r nd - CUrVe 1S stee P er than the first. This indi¬ 
cates that if the value of is low, the probability of accepting a lot is 

larger with the second rule than with the first—that is the risk of 
rejecting a good lot is smaller. Furthermore, if the value’of v is high 

Sh P £ W 7 f - Ce l ing . a , lot “ smaller with the rule than 

short fl ^~' that 1S ’ the n u s k ° f accepting a poor lot is smaller. In 
rnlp Vi,- °? 0 ^ U ' Ve Sh ° WS that both risks decrease with the second 

“ i h S 3 S ° •! c ° m ™ 0 l 1 sense - The second rule involves a larger 
sample. Hence, it should involve smaller risks. 

Thus, as our illustration indicates, the larger the sample, the 
smaller are all risks. And this reflects itself in a steeper OC curve. In 
tact suppose that we took a complete count of all 10,000 machine 
parts m a given lot and rejected the lot only if 1 per cent or more 
were defective The OC curve for this decision rule would appear as 
drawn on the following page. 
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This OC curve, based on a complete count, may be considered a lim¬ 
iting case. It represents a decision procedure characterized by cer¬ 
tainty: all lots with x equal to .01 or less have a probability of one of 
being accepted; all others have a probability of zero of being ac¬ 
cepted. This limiting case of the OC curve can occur only if com¬ 
plete information is obtained. If sample information is used as a 
basis for decision, there always is some risk of acting incorrectly. 
The OC curve obiectively depicts these risks for all possible values 


of a decision-parameter. 

In conclusion, it may be noted that the OC curve often is neces¬ 
sary to evaluate the results of secondary studies. For example, sup¬ 
pose that some researcher in the field of education recommends, on 
the basis of a sample study, that method A of teaching basic sta¬ 
tistics be preferred to method B. Among the information about the 
study that we should seek is the decision rule which led the re¬ 
searcher to his conclusion. Since his conclusion was based on sample 
information, there are risks that his recommendation was wrong. 
Hence, the OC curve which depicts these risks is an important con¬ 
sideration in our decision on whether or not to follow the recom¬ 
mendation. This choice depends on the risks involved and our atti- 
tude towards risk taking. 
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8.5 Acceptance Sampling 

The theory that we have developed in this chapter has been ap- 
p led most effectively to decisions on managerial problems in the 
area of inspection. Inspection refers to the function of checking the 
quality of some work already done. Problems of inspection arise in 
different areas of the business world. Thus, in manufacturing we 
have: (1) the inspection of purchased materials, parts, and sup¬ 
plies; (2) the inspection of work at various stages of the manufac¬ 
turing process; and (3) the inspection of the finished product. 
Checking the quality of incoming plastic material or inspecting 

a lot of finished electrical sockets represent such inspection situa- 
tions. 

. However, the word “inspection” is also used to refer to the check¬ 
ing or verification of clerical work and records. Thus, reviewing an 
inventory count or checking payroll vouchers illustrate clerical in¬ 
spection problems. 

The statistical techniques designed to cope with problems of in¬ 
spection are known as methods of acceptance sampling. Acceptance 
sampling procedures are designed to take action on aggregations of 
similar product-called a lot. The lot can be a case of milk bottles, a 
bale of cotton, a shipment of wire, the entries in an accounts pay¬ 
able ledger, or a group of punch cards. 

The alternative courses of action in acceptance sampling situa¬ 
tions are described as (a) accept the lot, and (6) reject the lot In 
industrial or clerical problems the action, to accept a lot, usually 
means that no inspection is needed. The action, to reject a lot, in in¬ 
dustrial problems, may mean: (1) send the lot back to the supplier, 
(2) completely inspect the lot, (3) rework the lot, (4) scrap the lot' 
or (5) sell the lot as second grade. The nature of the problem in each 
case dictates the particular type of action to be taken. In clerical 
operations, rejection usually means completely checking or verify- 
ing all items in a lot. 

To apply acceptance sampling, it is customary to define two 
states of the world—good lots and poor lots. We can portray an ac¬ 
ceptance sampling decision problem as follows: 


Alternative 

| State of the World 

Good Lot 

Poor Lot 

Accept the lot.... 

Correct 

Incorrect 

Incorrect 

Correct 

Reject the lot. 
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The errors, therefore, are to reject a good lot (Type I error) or to 
accept a poor lot (Type II error). The errors have been so desig¬ 
nated to achieve uniformity. In addition, the probability of re¬ 
jecting a good lot—the a-risk—is known as the producer's risk. The 
probability of accepting a poor lot-—the /3-risk—is known as the 
consumer’s risk. 

Generally, the producer’s risk is set at a lower value than the con¬ 
sumer’s risk. In other words, the decision maker is less eager to reject 
good product than to accept poor product. This is a result of analyz¬ 
ing the consequences of these risks. In clerical operations, it means 
that a department head is less eager to accuse an employee unjustly 
of doing poor work than he is to let a poor worker go by unnoticed. 
The reasons for this should be obvious. In industry it means that it 
is less desirable to delay a production schedule and to incur the 
wrath of a supplier than to rework poor product. 

As in any testing problem, we must describe states of the world 
in terms of a decision-parameter. The decision-parameter used 
most often in acceptance sampling is the population proportion, 
7 T— though p., a, and other decision-parameters are used occasionally. 
We must focus our attention upon two values of the decision- 
parameter—one value descriptive of good quality and the second 
value descriptive of poor quality. These values often are given 
special names. Thus, when is the decision-parameter, one value 
(*r 0 ) is called the acceptable quality level and is referred to as the 
AQL. It represents that proportion of defectives in a good lot—a 
lot which a decision maker is willing to reject with a probability 
of a. The second value (^i) is the lot tolerance per cent defective 
which usually is referred to as the LTPD. It represents the propor¬ 
tion of defectives in a poor lot—a lot which a decision maker is 
willing to accept with a probability of ft. 

Example 8.4 A television manufacturer receives shipments of cut 
wire in lots of about 4,000 pieces. For each lot, he must decide 
whether to accept it or to reject it. A piece of wire (about 10 inches 
long) must be of a certain resistance to be usable. Therefore, a wire 
is either acceptable (if it can be used) or defective (if it cannot be 

used). .... . ., , 

Suppose the company sets the following objectives m its accept¬ 
ance sampling plan: 

AQL: to = 2% a = .05 
LTPD: n = 6% ft = .10 

This means that it is willing to use a sampling plan which will 
achieve the following: 
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. L 7 / a shipment of wires is 2 per cent defective, the probability 
it will be accepted is .95 and the probability it will be rejected (a) 
is .05. This 2 per cent (tt 0 = .02) is the acceptable quality level. 

2. If & shipment of wires is 6 per cent defective, the probability it 
will be rejected is .90 and the probability it will be accepted (/?) is 
.10. This 6 per cent (tti = .06) is the lot tolerance per cent defective. 

Table 8 .7 

TABLE FOR DETERMINING 
ACCEPTANCE SAMPLING DECISION RULES 
ASSUMING SIMPLE 
RANDOM SAMPLING 

a = .05 jS = .10 


R 

c 

nirq 

45.10 

0 

.051 

10.96 

1 

.355 

6.50 

2 

.818 

4.89 

3 

1.366 

4.06 

4 

1.970 

3.55 

5 

2.613 

3.21 

6 

3.285 

2.96 

7 

3.981 

2.77 

8 

4.695 

2.62 

9 

5.425 

2.50 

10 

6.169 

2.40 

11 

6.924 

2.31 

12 

7.690 

2.24 

13 

8.464 

2.18 

14 

9.246 

2.12 

15 

10.040 


Source: F. E. Grubbs, “On Designing Single 

Sampling Inspection Plans,” Annals of Mathemat¬ 
ical Statistics, Vol. XX, (1949), p. 256. 

Instructions for Use 

1. Calculate R — tti/tto (the ratio 
of the LTPD to the AQL ). 

2. Find R in the table above. If it 
is not there, use the next smaller 
value shown. 

3. Read off c. This is the acceptance 
number. 

4. Read off mr 0 and divide by the 
specified 7r 0 (the AQL for the 
problem) to get n, , This is the 
required sample size. 

5. The statistical decision rule is: 
Select a simple random sample 
of size n , and accept the lot only 
if there are c or fewer defectives 
in this sample. 
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Since we have a, p, * 0 , and r 1; we could determine the decision 
rule for the above illustration by the methods developed in Section 
2 of this chapter. If we were to do so, we would find that n should 
be about 200 and p* about .04. Thus, the decision rule is as fol¬ 
lows: Select a simple random sample of 200 wires. If p - -04 or 
more, reject the lot; if p is less than .04, accept the lot. . 

In acceptance sampling instead of expressing the critical value 
in terms of p we usually convert p* into an acceptance number (c). 
An acceptance number is the maximum number of defectives we 
can observe and yet accept the lot. In the decision rule stated 
above, p* is .04. In a sample of 200 this represents 8 defective units. 
Since we would reject a lot if a sample of 200 units had 4 per cent 
or 8 defectives, the acceptance number is 7—we would still accept 
the lot if c were 7. This decision rule can be very simply written 
as ( n _ 200 , c = 7). This is a concise way of saying: Select a simple 
random sample of 200 items. If 7 items or less are defective, accept 
the lot; if 8 items or more are defective, reject the lot. The use of 
the acceptance number obviates the necessity of converting the 
number of defectives into a proportion in order to apply a decision 

It would be an arduous task if a firm had to use the method of 
Section 8.2 to determine a decision rule for each of its many inspec¬ 
tion operations. Therefore, various general tables have been pre¬ 
pared to facilitate the determination of decision rules. One such 
table and instructions for its use are presented in Table 8.1. This 
table is based on a producer’s risk (a) of .05 and a consumer’s risk 
(0) of 10 Other tables are available for different values of » and p. 

We now illustrate the use of Table 8.1 (p. 259) to formulate a 
decision rule. 

Example 8.5 The work of key-punch operators is to be checked 
in batches of 2,000 cards. A decision must be made on whether to 
send the cards out to be tabulated “as is” (accept the lot) or to check 
all of the 2,000 cards (reject the lot). . . 

An acceptance sampling plan is used to make the decision, lhe 
following objectives are set for the decision procedure. 

AQL: in, = .005 a = .05 
LTPD: in = .025 p = .10 

The statistical decision rule to meet these objectives (assuming sim¬ 
ple random sampling) can be found from the Table 8.1. The value o 

R is 


7n/7ro = .025/.005 = 5 . 
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The next smaller value in the table is R = 4.89; c is 3, and is 
1.366. Therefore, 


n = 


1.366 

.005 


273, 


or 275, for administrative convenience. 

The statistical decision rule is: Select a simple random sample of 
275 cards. Send the batch of 2,000 cards to be tabulated if there are 3 
or fewer incorrectly punched cards in the sample; otherwise check all 
of the 2,000 cards. ’ 

The operating characteristic curve for this decision rule appears as 
follows: 



Note that at tt 0 — .005 (the AQL) the probability of not checking the 
cards is approximately .95 (and of checking them is .05). At = 
*?n 5 LTPD ) tlie probability of not checking is approximately 
.10. This is as.it must be, for we set the conditions, a = .05 and 
P — .10, m stating the objectives of the sampling plan. 

Suppose that we now review the basic principles of how to set 
"i’ “< and P- These principles have been implicit in our work, 
but let’s make them quite explicit.—even going so far as to number 
them: 
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1 In selecting values for a, p, tt 0 , and *i, it usually is easiest first 
to set a = .05 and p = .10 (or a = .01 and p = .05, or some other 
pair of standard values). Then one should ask himself what value 
of 7 T he wishes to associate with a .05 risk of rejecting that lot; the 
answer is the value of tt 0 , or the AQL. Similarly one should ask 
himself what value of tt he wishes to associate with a .10 risk of 
accepting that lot; the answer is the value of v lt or the LTPD . 

2. In setting the AQL and the LTPD , as well as « and P, one 
must consider the consequences of possible decisions. The following 
questions must be considered: What are the consequences of ac¬ 
cepting poor lots? What are the consequences of rejecting good 
lots? What is a poor lot? What is a good lot? 

3. As a general rule, the acceptable quality level should be the 
quality level for which management is aiming. An acceptance sam¬ 
pling plan which has a good chance of rejecting lots of lesser quality 
ma y persuade vendors to ship lots of the acceptable quality. 

4. The lot tolerance per cent defective set by management must 
not be an average quality or a quality level to which it is indifferent. 
LTPD should represent a relatively poor or barely tolerable quality 

level. 

5. Both the AQL and the LTPD should be possible quality 
levels For example, suppose that the lot tolerance per cent defec¬ 
tive is set at .08 with a p-risk of .10. Then, if a lot as much as .08 
defective is received, the chance of accepting it will be .10. How¬ 
ever, if there is no chance that such a lot will be presented, the 
sampling plan is giving protection against something that will 
never occur! The plan is not the right one for the decision problem. 

6. The values of tt 0 and v 1 must not be too close. If they are, a 
large sample size will be necessary to achieve the specified risks. 
Often, in practice, after setting % and ^ management may find the 
required sample size uneconomical. The original goals may then be 
revised. 

7. It must be recognized that the producer’s risk («X and the 
consumer’s risk (p) are both risks to the decision maker. If a con¬ 
sumer rejects lots of good quality too often, he is committing the 
error associated with the producer’s risk. However, these mistakes 
ultimately will be passed back to him in terms of bad will and 
higher prices. 

8. If a series of lots are received over a period of time, it may be 
profitable to revise the sampling plan periodically. 
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8.6 Other Acceptance Sampling Plans 

The acceptance sampling plans that we have been discussing are 
known as fixed-sample size or single sampling plans—a decision to 
accept or to reject a lot is reached on the basis of only one sample 
At times, however, double, multiple, and sequential sampling plans 
are used. 

Under a double sampling plan a first sample is drawn and one of 
the following actions is taken: (a) accept the lot, ( b) reject the lot, 
or (c) take a second sample. If a second sample is selected, the al¬ 
ternatives then are to accept or to reject the lot. Thus, under a dou¬ 
ble sampling plan, an ultimate decision on the lot may be reached 
after the first sample is inspected but must be reached after the in- 
spection of the second sample. 

A multiple sampling plan simply is an extension of double sam¬ 
pling. Provision is made for the possibility of several samples After 
each sample is inspected, a final decision on the lot is made or an¬ 
other sample is selected. However, an action to accept or reject the 
lot must be taken after a predetermined number of samples have 
been selected. 


Under one type of sequential sampling plan, items are inspected 
one at a time. After each inspection a decision must be made to ac¬ 
cept the lot, to reject the lot, or to select another item. In other se¬ 
quential plans, sequences of groups of items rather than one item at 
a tune are inspected. 1 In either event it is not possible to determine 
m advance the maximum number of items that must be sampled to 
reach a final decision on the lot. At times, therefore, a sequential 
plan may be truncated—i.e., a decision must be reached after a pre¬ 
determined number of samples. Such truncated plans simply are a 
type of multiple sampling. 

For a given *■„, w u a-risk, and /3-risk, the single, double, multiple, 
and sequential sampling plans can be determined by reference to 
tables which are similar to Table 8.1 and which have been devel¬ 
oped by statisticians. Each plan so found provides approximately 
the same protection against incorrect actions. That is the oper¬ 
ating characteristic curves for the four sampling plans are almost 
identical. 


i .y? theory underlying sequential sampling plans was first developed in 1943 
by Abraham Wald (1902-50), a mathematical statistician who was responsible to 
many important contributions to modem statistics msponsible lor 
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Table 8.2 presents similar single, double, and multiple sampling 
plans. For each plan the AQL is 4 per cent and the LTPD is 9 per 
cent. Also, in all cases, a = .05 and (3 = .10. For each sample that 
might be selected, an acceptance number and rejection number is 
listed. If the number of defectives is equal to or less than the. ac¬ 
ceptance number, the lot is accepted. If the number of defectives 
is equal to or larger than the rejection number, the lot is rejected. 
If the number of defectives is greater than the acceptance number 

Table 8.2 


SINGLE, DOUBLE, AND MULTIPLE SAMPLING PLANS 

7To ” .04 7T1 = .09 

a = .05 (3 = .10 


Type of 
Sampling 

Sample 

Sample 

Size 

Combined Samples 

Size 

Acceptance 

Number 

Rejection 

Number 

Single 

First 

225 

225 

14 

15 

Double 

First 

150 

150 

9 

24 


Second 

300 

450 

23 

24 

Multiple 

First 

50 

50 

1 

6 


Second 

50 

100 

3 

y 


Third 

50 

150 

7 

13 


Fourth 

50 

200 

10 

16 


Fifth 

50 

250 

13 

19 


Sixth 

50 

300 

16 

22 


Seventh 

50 

350 

19 

25 


Eighth 

50 

400 

24 

25 


By permission from Columbia Statistical Research Group, Sampling Inspection (New York: 
McGraw-Hill Book Co., Inc., 1948), p. 330. 


but less than the rejection number, another sample is selected. 
Both the acceptance and the rejection numbers refer to the number 
of defectives in the combined samples. Thus, for example, consider 
these numbers for the fifth sample of the multiple sampling plan. 
If a fifth sample need be drawn and if 13 or less of the 250 items 
selected for the five samples are defective, accept the lot. If 19 or 
more of the 250 items are defective, reject the lot. If from 14 to 18 
items are defective, select a sixth sample. 

Note that under double sampling an ultimate decision on the lot 
must be reached after the second sample is inspected. With the mul¬ 
tiple sampling plan under consideration, an ultimate decision must 
be reached after the eighth sample is inspected. This is apparent for 
the rejection number is one larger than the acceptance number. 
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When ^queiUml sampling is employed, it often is simpler to 
present the decision rule as a chart. Figure 8.3 presents a sequential 
sampling plan based on the same ^ a-risk, and /3-risk as the 
plans presented m Table 8.2. The acceptance and rejection num¬ 
bers are given by the following equations: 


A„ = -2.6045 + .0618m 
R„ = 3.3439 4- ,0618n. 

A n and R n are the acceptance and rejection numbers, respectively 
for a cumulative sample of n items. These lines divide Figure 8.3 
into three regions: an acceptance region, a rejection region, and a 


Figure 8.3 



SteL f t Urtl T IT 1 * 11 ?' After 6aCh it6m iS inSpected ’ a point 

s plotted at a height equal to the combined number of defectives. 
. °, r e ^ am P^ e > one P oi nt (D) is marked on the chart. This point 
indicates that after the thirtieth item is inspected, 3 of the 30 
items are defective. Since this point falls between the two lines 
sampling is continued. If a plotted point falls on or above the 
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upper line, the lot is rejected. If a point falls on or below the lower 
line the lot is accepted. 

Note that under this plan, a lot cannot be accepted until more 
than forty (43) items have been inspected. This is the first time t a 
it is possible for a point to fall into the acceptance region. This how¬ 
ever is less than the smallest sample size, 50, required to reach a de¬ 
cision under the equivalent multiple sampling plan. It is much 


r* _ O J 
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sampling plans is that they may require a smaller sample size, on the 
average, to reach a decision on a lot. For a single sampling plan, the 
sample size is fixed no matter what the quality of an incoming lot 
may be. Under the other plans, the number of items needed to reach 
a decision varies. It is possible, therefore, to reach a decision under 


Figure 8.5 



these plans with a smaller average sample size than under single 
sampling plans. In particular such economies will occur whenever 
incoming lots are either of very good or of very poor quality. A small 
sample, on the average, will be sufficient to detect such extremes in 
quality. For lots of intermediate quality, larger samples may be re¬ 
quired to reach an ultimate decision. In Figure 8.5 we present curves 
which depict the relationship between the average sampling num- 
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ber and the quality of incoming lots for the four sampling plans con¬ 
sidered in this section. The relationships presented are fairly typical. 
Sequential sampling uses the smallest average sample size. Gener¬ 
ally speaking, multiple, double, and single sampling follow in that 
order. Note, however, that for lots of a certain quality (from 5 to 
10 per cent defective) the average sample size for the single sam¬ 
pling plan is less than for double sampling. 

A second advantage claimed for double, multiple, and sequential 
sampling plans is their psychological appeal. Suppliers often feel 
that such plans are more equitable since more than one chance is 
given a lot before a possibly adverse decision is made. Such beliefs, 
however, are erroneous for actually vendors are given the same pro¬ 
tection under any plans for which the OC curves are alike. 

Shortcomings of double, multiple, and sequential plans arise 
from the administrative difficulties they present. They involve more 
record keeping than single sampling plans. Hence, it is more difficult 
to train inspectors, and it increases the likelihood of clerical errors. 
Under single sampling plans the sample size per lot is constant, and, 
hence, it is easy to set up inspection schedules. Under the other 
plans sample size varies from lot to lot, and inspectors may be over¬ 
worked at times and underworked at other times. Finally, for dou¬ 
ble, multiple, and sequential plans physical problems of selecting 
more than one sample may exist. Thus, if all the samples that might 
be needed under a multiple sampling plan are drawn before inspec¬ 
tion begins, too many items might be selected and costs incurred 
needlessly. However, if samples are drawn as needed, the inspection 
department might become cluttered with lots in the process of in¬ 
spection. In general, then, the pertinent facts surrounding a given 
inspection situation will dictate the appropriate method of sam¬ 
pling. 

8.7 Management Objectives and Decision Rules 

In the previous sections of this chapter we considered the deter¬ 
mination of decision rules to provide management with the requi¬ 
site degree of protection against the two types of possible errors in 
two-action problems. Management sets its goals in terms of an 
a-risk associated with one state of the world and a /3-risk associated 
with a second state of the world. Statistical theory then enables us 
to determine the sample size and the critical value of the statistic. 

However, in setting its objectives in this way, management gives 
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up its control over the sample size—an important factor in the cost 
of a sampling procedure. Once the risks are set, the sample size fol¬ 
lows directly from the statistical techniques employed. The sample 
size obtained in this manner gives management the desired protec¬ 
tion but at times its cost may be prohibitive. Then, again, the size 
of the sample may be so large that the decision procedure cannot be 
employed expeditiously. Thus, consider a situation where it is nec¬ 
essary to check a production process at regular intervals to decide 
whether or not the process is operating satisfactorily. If the inspec¬ 
tion process is a lengthy one—for example, involving laboratory 
tests—a large sample may be too time consuming. The damage may 
be done before the sample results are made available. 

. Under such circumstances it may be desirable to set the sample 
size m advance. However, in this case, we can control the proba¬ 
bility of only one type of error. In other words, for the advantage of 
being able to control the cost or expeditiousness of a sampling pro¬ 
cedure, the control over one type of error is sacrificed. Therefore, we 
must look for a decision rule which will make use of the desired sam¬ 
ple size and give us the desired protection against the type of error 
we are more eager to avoid. Subsequent to the determination of the 
decision rule, risks of other errors also may be computed. 

To illustrate the determination of a decision rule under such cir- 
cumstances, consider the following example. 


Example 8.6 A company samples 4 steel rods every half-hour and 
determines their average tensile strength. On this basis, it decides 
whether a process is or is not operating satisfactorily. Past experi¬ 
ence has indicated that a desirable process average is 600 pounds 
per square inch, and the process standard deviation is 20 pounds per 
square inch. It is willing to run a risk of .005 of stopping the process 
when it is operating satisfactorily. Thus, the company is more eager 
to avoid the risk of stopping the process unnecessarily rather than 
leaving the process alone if it is not operating satisfactorily. 

_ tllat l,lle decision rule is to attain are n = 4 and 

“r j-I f, he rule must specify the critical values of x. The sam- 

ri^rih dl t lbUtl iT iG lf M = 600 wil1 be a PP rox ™ated by a normal 
distnbution with E(x) = 600. The standard error of the mean is 
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The sampling distribution and the area conforming to an a-risk of 
.005 are shown in the diagram below: 



The diagram indicates that 

x * = n - z<h and x* = n + ■ 

Since the area in each tail is .0025, z = 2.81. Hence 

2 _ goo - (2.81)(10) and x* = 600 + (2.81)(10) 
= 572 = 628• 


The decision rule is: Select a simple random sample of 4 items. If 
x is between 572 and 628, leave the process alone. If x is equal to 572 
or' less, or if x is equal to 628 or more, stop the process. We could 
determine the OC curve of this decision rule if it were desirable to do 
so. This would indicate the risks of incorrect decisions not only lor 
,l = 600 but for all other possible values of p. 


There are, however, other goals which management might set 
for a decision rule. Consider an acceptance sampling situation where 
the alternative courses of action are to inspect and not to inspect the 
lot. Under such circumstances, management may require a decision 
rule that would keep total inspection costs of good lots at a mini¬ 
mum. The total cost includes the cost of sample inspection and the 
cost of inspecting lots that are not accepted. But, to obtain this ob¬ 
jective, management must again give up control over the ris o 
committing one type of error. We shall not discuss the development 
of decision rules when the objectives of a decision rule are set m this 
way However, these objectives are very common, and tables have 
been worked out to facilitate the determination of decision rules. 
There are tables to provide minimum inspection of good lots and 
control over the probability of accepting poor lots. You will find ref¬ 
erences covering such tables and their use at the end of the chapter. 

We have not listed all the goals management might set for a de¬ 
cision rule. You are aware, however, that there are several. Further- 
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more, in two-action problems, it is possible to control only two inde¬ 
pendent objectives. In setting goals it is, therefore, most important 
to determine which objectives it is most economic to control. 

8.8 You Should Now Know That 

A statistical testing problem involves a choice between two or 
more qualitative courses of action—the decision to be based on a 
sample of observations. 

To treat a testing problem statistically states of the world must be 
described by a decision-parameter. 

Two-action problems are those in which only two alternative 
courses of action are available. 

These problems present the possibility of two types of incorrect 
decisions: called errors of the first kind (or Type I errors) and errors 
of the second kind (or Type II errors). 

By convention, a problem should be formulated so that the more 
serious error is considered the error of the first kind. 

The probability of an error of the first kind is called the a-risk. 
The probability of an error of the second kind is called the /3-risk. 

Both of these risks are conditional upon the existence of a particu¬ 
lar state of the world (a particular value of a decision-parameter). 

It is possible to describe decisions in terms of the acceptance and 
the rejection of hypotheses. 

A hypothesis is an assumption about which state of the world is 
true. 

A decision rule for a testing problem must specify (a) the method 
of sampling, (b) the sample size, and (c) critical values which relate 
all possible values of a statistic to decisions. 

Decision rules may be one-tailed or two-tailed, depending on the 
nature of the problem. 6 

One method of determining statistical decision rules for testing 
problems requires one to focus attention on pertinent states of the 
world, to specify a- and /3-risks, and to solve appropriate equations 
for the sample size and critical value (s). 

An OC curve depicts, for all possible values of a decision-parame- 
ter, the risks of relying on a given decision rule. 

The larger the sample size, the steeper is the OC curve. 
Acceptance sampling is the application of statistical testing to de¬ 
cision problems in industrial and clerical inspection. 

In acceptance sampling the a-risk is called the producer’s risk ; 
the /?-risk is called the consumer’s risk. 
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The two values of the decision-parameter which describe states of 
the world in acceptance sampling often are called the AQL, accepta¬ 
ble quality level, and the LTPD, lot tolerance per cent defective. 

At times it is economical to use double, multiple, or sequential 
sampling plans in acceptance/rejection decision problems. 

In the determination of decision rules, it is possible to control only 
two independent objectives. In setting goals, it is, therefore, impor¬ 
tant to determine which objectives it is most economic to control. 

8.9 You Should Now Be Able to Solve 

1. Would you or would you not suggest a statistical testing pro¬ 
cedure in each of the following problem situations? Explain your 
answer 

a) An auditor must decide whether to approve a payroll for 5,000 
workers or to check the entire payroll for clerical errors. 

b) A cost accountant wants to determine the average waste produced 

in carrying out the operations of a given department as a basis for 
deciding on whether or not to recommend revamping those opera¬ 
tions. , 

c) A company wants to decide whether or not to expand the size ol 

its plant. . . . 

d) A company wants to take remedial action if the proportion of mis¬ 
filed papers of its filing department increases. 

e) Management must make a decision on whether or not to promote 
an assistant buyer to the rank of buyer in a department store. 

2. A company receives glass tubes in lots of 10,000. The propor- 
tion of broken tubes, *•, is considered the decision-parameter. It 
desires an acceptance sampling procedure that will offer it the fol¬ 
lowing protection: If «• = .004, the probability of rejecting the lot 
is to be 05 (o-risk); if *• = .020, the probability of accepting the 
lot is to be .10 (/8-risk). Use Table 8.1 (p. 259) to determine the 
decision rule. 

3. A firm that employs 2,500 people wishes to control excessive 
lateness. Past experience has indicated that, on the average, only 
2 per cent of its employees are late. The firm wishes to maintain this 
average. The firm desires a statistical decision rule which will enable 
it to decide when and when not to take action on excessive tardiness. 

a) What are the two types of incorrect action that might be taken? 

b) Is the population of interest finite or infinite? 

c) What is an appropriate decision-parameter? _ . 

d) The firm decides to use n = 2,500 (all of the employees). Is the 
problem still a sampling problem? Explain. 
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e) Since n is given, only one risk can be specified. Which of the two 
types of error would you desire to control? Explain. 

}) Set a risk, that you are willing to take, of making the type of 
error that you have selected in (e). What factors did youtake 
into consideration m setting this risk? 7 1 K 

g) Is the decision rule one-tailed or two-tailed? 

h) Derive the decision rule. 

Sirve ri Sow? nd §raPh thC ° C f ° r thk ruIe ‘ What does this 

4. A large grocery chain takes four inventories of canned goods 

fheTh 111 ea i h h T St °i e u If there iS a shortage or overage between 
the physical check and the record of the store’s inventory in the 

central office further investigation is to be made. Management 
seeks a statistical decision rule to relate observations to the appro¬ 
priate action. The decision-parameter that is agreed upon is the 
population mean-the average value per physical unit of inventory. 
Assume that <>• = 15 cents. y 

The following states of the world and risks of incorrect decision 
are decided upon: 

m = average value based on the company’s records 
Mi = mo - $.01 

M 2 = fi§ $.01 

a = .01 

P = .05 

a) What is the Type I error in this problem? 

b) What is the Type II error? 

c) Explain the meaning of a = .01. 

d) Explain the meaning of ft = 05 

6) Expt e in dedSi0n mle f ° r thiS Pr ° blem one " tailed two-tailed? 

mkf ValU6S be COmputed to det ermine the desired decision 

^ vaffie) 6 the deCi8i0n mle ( a ssume M equal to any reasonable 

k) T d graph the 0C c orve for the decision rule. How do 

you interpret the meaning of this curve? 

5. A company wants to decide on whether to use iridescent wrap- 

S SonTui? “ PS " n0t Y “ “ “ ted '» 

SLT.“mi"ESr ta P01, "'* li " “ d “ 

slkcfv't mle f b + l a T e - tailed 0r a two -teiled rule? Explain. 

tkn Explain tL decisl o n ~P arame ter upon which to focus atten- 
non. Jiixplain the reasons for your choice. 
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d) What are the possible types of incorrect action in this situation? 

Explain. „ .... , 

e) Set what you consider economical risks of committing each error. 

Explain the reasons for your choice. 

/) Which is the a-risk? Which is the 0-risk? Explain. 
q) Determine the decision rule. 

h) Would it be helpful to determine the OC curve for this decision 
rule? Why or why not? 


8.10 You Will Also Find That 

Key general references on statistical testing, hypotheses, and op¬ 
erating characteristic curves include: 

Neyman, J. First Course in Probability and Statistics , pp. 250-67. New 
York: Henry Holt & Co., Inc., 1950. 

Sprowls, R. Clay. Elementary Statistics, pp. 51-95, 183-206. New York: 
McGraw-Hill Book Co., Inc., 1955. 


For discussions of the applications of statistical testing to indus¬ 
trial inspection decision problems (acceptance sampling), the fol¬ 
lowing references are useful: 

Grant, Eugene L. Statistical Quality Control, pp. 311-38. 2d ed. New 
York: McGraw-Hill Book Co., Inc., 1952. 

Duncan Acheson J. Quality Control and Industrial Statistics , pp. 
131-60. Rev. ed. Homewood, Ill.: Richard D. Irwin, Inc., 1959. 

A survey of acceptance sampling procedures for use in accounting 
onri ;m nlpripfll nnerations is contained in: 


Vance Lawrence L., and Neter, John. Statistical Sampling lor Audi¬ 
tors and Accountants, pp. 27-52, 91-101. New York: John Wiley & 
Sons. Inc., 1956. 



CHAPTER • 9 


Error and Bias 


9.1 Introduction: Sampling Errors versus Procedural Bias 

• ^1°^ , man ? r . y ears a S° one well-known statistician pointed out that 
in the teaching and the learning of statistics “perhaps the funda¬ 
mental thing is to bring the students to the point where they will 
never again look at a number with the same naive, trusting confi¬ 
dence they had as grade school children.” 1 This chapter focuses on 
this idea. 

In the past two chapters we have considered risks—risks of incor¬ 
rect decisions m estimation and in testing problems. These risks are 
the risks of acting on the basis of partial or sample rather than com- 
plete information. They are due to the possibility of sampling error 
—the difference between a sample result and the result that would 
have been derived from a complete count. It is important to re¬ 
member that statistical theory alone provides procedures for select¬ 
ing samples so that one may measure and control the risks due to 
sampling. 

However, it is equally important to recognize that in addition to 
sampling errors there are many other types of possible “errors” that 
may mar the results of a statistical study. These also present risks 
of making incorrect decisions to a decision maker. These other types 
o errors are not due to sampling. They would arise even though a 
complete count were taken. They are biases which result either from 
the way the problem was defined or from an imperfection in the 
method used to elicit information. Hence, they are called proce- 
aural biases or nonsatnpling errors . 


1 Irving W. Burr, "What Principles and 
Taught m High School and Junior College?” 
(1951), p. 12. 


Applications of Statistics Should Be 
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For example, suppose that a decision problem is incorrectly formu¬ 
lated—say, by specifying the wrong decision-parameter. This may 
result in an incorrect decision—it certainly adds to the risks of taking 
the wrong action. This imperfection is not, however, an error of 
sampling. It is a built-in deficiency. It may bias the results of a sam¬ 
ple, but it also may bias the results of a complete count. 

Consider, also, a marketing research study of incomes in a com¬ 
munity. If observations are to be made by requesting families to pro¬ 
vide this information about themselves, the possibility of bias exists. 
Persons who would tend to overstate or to understate their incomes 
might introduce a bias into the results of the study—and this would 
be true whether a sample or a complete count were taken. 

The results of almost every statistical study are subject to some 
imperfections—sampling errors, procedural biases, or, more usually, 
both. We have studied, in the preceding chapters, how statisticians 
have developed procedures which allow the measurement and the 
control of risks of sampling error. In this chapter we will study pro¬ 
cedures for the control of bias. 

Imperfections in numerical information, whether they be the re¬ 
sult of sampling or of a biased procedure, are of obvious importance 
to a decision maker. He cares not why an incorrect decision is made 
—to him only the fact that erroneous information led him astray is 
important. For example, what would be the value of a carefully and 
properly designed probability sample for a marketing research study 
if the questionnaire was so poorly designed that it elicited biased and, 
hence, useless answers? Thus, in designing a statistical decision pro¬ 
cedure we must consider not only the risks of relying on sample in¬ 
formation but also the risks of relying on information that may be 

biased. , 

Nevertheless, the distinction between sampling error and proce- 

dural bias is important, and in speaking of the degree of the trust¬ 
worthiness of data from any study, statisticians use some special 
terminology to point up the distinction. The precision of a proce¬ 
dure is measured by the size of the potential sampling error. The ac¬ 
curacy of a procedure refers to the magnitude of the sampling error 
and the potential bias in information. Thus, to point up this dis¬ 
tinction, note that data collected by a complete count are perfectly 
precise, but they are not necessarily very accurate. Their accuracy 
depends on the extent and the magnitude of procedural, biases. 

Unfortunately, the control of biases is operationally limited be¬ 
cause few special techniques for this task have been developed. In 
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contrast to sampling error, there are almost no known methods of 
measuring the magnitude of biases, although some statistical tech¬ 
niques are available to check for the presence of such biases. In any 
event, management and the statistician always must be alert to the 
possibility of both error and bias. They can control the former by 
using probability sampling. However, generally speaking, the only 
way of controlling biases is by so planning a study that the oppor¬ 
tunities for their occurrence are limited. In order to do this, those 
conducting a sample study or a complete count must be aware of the 
various opportunities for bias. Once aware of these opportunities 
they may prepare for them. 

Thus, in the subsequent sections of this chapter we shall survey 
various types of procedural biases and see how in primary studies, 
proper planning can minimize the opportunities for their occurrence. 
Knowledge of the ways in which biases may occur also is important 
to a decision maker who is using secondary data—information col¬ 
lected for another purpose, perhaps under some other auspices. 
Without knowledge of the accuracy of such data, how can one have 
confidence in them? One should be suspicious of almost all data 
until he verifies whether or not they were collected by a reliable 
and valid procedure . Of course, some data are more trustworthy than 
others, and the auspices under which data are collected often is a 
guide to their accuracy. One need not reject the trustworthiness of 
all numerical information. He merely should be suspicious of almost 
all data—-at least until he is familiar enough with the procedure by 
which they were collected. 

9.2 Improper Formulation of the Decision Problem 

No type of bias is more unfortunate than that which results from 
the improper formulation of the decision problem, per se. Such im¬ 
perfections often result in a statistical study providing the right 
answers to the wrong questions—which only can mean the right de¬ 
cision for the wrong decision problem. Unfortunately, there are sev¬ 
eral specific opportunities for such deficiencies to be built into any 
study. J 

First of all, bias may result from a failure to think through and 
specify carefully enough the essential ingredients of the decision 
problem (the alternative courses of action, states of the world and 
consequences). Failing to recognize exactly what actions can be 
taken and upon what factors these actions should depend can lead 
only to the wrong definitions of the population and of the decision- 
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parameter(s). And as we know, no sample (nor any com P let ^ 0 ™^ 
can provide information about anything but the population fro 
which it was selected. 

Example 9.1 To point up the fact that it is essential that^any 
statistical population of study be defined properly-that it be de 
fined by management and the statistician to include pertinent obse 
vations—one statistician once pointed out the iti 

“Take the case of the manufacturer of dog food . . "lio ™t out 

and did an intensive market study. He tested the demand fo g 
food he tested the package size, the design, the whole advertising 
program Then he launched the product with a big campaign, got i 
propertlistribution channels, put it on the market and had tremen¬ 
dous sales But two months later, the bottom dropped out--no fol- 
fow-un sales So he called in an expert, who took the dog food out to 
the local pound, put it in front of the dogs-and they would not touch 
it. For all the big premarketing study, no one had tried t e pro uc 

on the dogs.” 2 

Thus, the population of study must be specified carefully. The 
definition of an elementary unit must be clear; that of an observa¬ 
tion unambiguous. Unfortunately, this is no a ways' e ^'' 
the problem of estimating total unemployment m the United States 
on a given date. The number of unemployed will depend on the 
definition given “an unemployed person.” Unfortunately econo¬ 
mists differ in their opinions of what is an acceptable definition. 
What one may consider a valid definition, another may regard as im¬ 
proper. Hence, one economist will regard a study as unbiased; an¬ 
other will consider it biased. Needless to say, statistics cannot solve 
this problem; the answer, if there is one, must be provided y eco- 

110 Anadditional problem in some studies is that a satisfactory frame 
which identifies the population must be provided The problem o 
securing an adequate frame is not always simple and can be costly- 
as we saw in our discussion of frames. However, the cost of a good 
frame must always be weighed against the defects in a bad one. We 
h” onsidered earlier how frames may suffer from detoto-^- 
pleteness, duplication, inadequacy, inaccuracy, etc. These defect 
may result in the data collected being seriously biased. Why? The 
frame fails to identify the correct population with a reasonable de- 

gree of accuracy. 

sity of Illinois, 1950), pp. 181-82. 
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Example 9.2 Suppose that the school board of a community 
wishes to determine the attitudes of families with children in school 
towards a suggested modification in their schools’ extracurricular 
programs. A readily available frame is a list of school children. How¬ 
ever, this frame will duplicate the names of several families for some 
families will have two or three or more children in school. 

Thus, the available frame suffers from a defect—duplication— 
which if allowed to stand might result in a bias in the results of the 
study. The potential bias arises from the fact that the frame affords 
some families, perhaps with atypical attitudes, a greater chance to 
appear in, say, a simple random sample from the frame. 

One possible solution is to ignore this bias. Another, obviously 
more costly but often more preferable alternative, is to check the 
frame and sort out the duplicates. In addition, some sampling tech¬ 
niques have been developed so that a sample may be drawn and es¬ 
timates prepared in such a way that the bias is minimized. 

Also note that the suggested frame would be incomplete if one were 
were trying to study the attitudes of all families in a city—families 
without school children would not be identified by the frame. This 
would, more or less, bias the results of a study, depending on its na¬ 
ture and purpose. 

As our discussion in this section indicates, biased information 
may result not from the failure of statistical sampling but from the 
exercise of poor judgment in the formulation of a decision problem. 
This is why many persons have pointed out that management, as 
well as a statistician, must take an active role in the planning of 
any study. Together, these persons, aware of the opportunities for 
bias due to the improper formulation of the decision problem and 
due to other factors as well, may so plan a study as to minimize the 
potential effects of bias. 

9.3 Biased Selection Procedures 

In discussing methods of probability sampling you will recall that 
in all cases we defined a particular method in terms of the probabil¬ 
ities of selection it should yield. For example, simple random sam¬ 
pling was defined as a method of selection which gives every pos¬ 
sible sample of size n the same chance of being the sample chosen in 
a study. Subsequent to these definitions, we considered administra¬ 
tively feasible procedures for selecting samples. Thus, we used 
random numbers to draw simple random and other probability sam¬ 
ples. Implicit in the use of any such procedure is the assumption 
that it meets the probabilistic definition of the method adopted. 
Thus, for example, if we use random numbers in order to select a 
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simple random sample, we are assuming that if the random num¬ 
bers were used repeatedly every possible sample of size ti would oc¬ 
cur approximately the same number of times. 

Unless our modus operandi meets the theoretical definition of the 
specified method of sampling, we say that our sampling procedure is 
biased. If it does not give, for example, every sample combination of 
size n the same chance of being selected when it should, we have a 
biased procedure for selecting a simple random sample. A biased 
sampling procedure is undesirable because it means that our calcula¬ 
tions of sampling risks are based on an erroneous assumption, which 
leads us to think that we know these risks when actually we do not. 

Example 9.3 Consider the problem of checking on the quality of 
a filing process in order to provide a basis for a decision on whether 
or not to install a modern records system. Assume a simple random 
or some other type of probability sample is to be drawn from exist¬ 
ing records and that the records are filed in floor-to-ceiling cabi- 

\f a sample were to be drawn from this system without the benefit 
of random numbers, several opportunities for a biased sampling pro¬ 
cedure exist. Thus, if clerks were asked to pick a random sample from 
the files, there might be an unwitting tendency to avoid the selec¬ 
tion of records in the highest or in the lowest cabinets. Those records 
would be less conveniently available and perhaps their chances of 
appearing understated. But in the records-keeping operation the 
more inconvenient position of these cabinets may result in more fil¬ 
ing errors. For a sampling procedure to be unbiased these “high-low” 
records should be given their proper chance to appear in a sample. 

To eliminate this aforementioned bias, and any others which we 
cannot even suspect, it would be best to use random numbers to de¬ 
termine the elementary units selected for the sample. 

Thus a way to avoid bias in a sampling procedure is to use random 
numbers. Is this a good solution? Statisticians are able to answer 
“yes” on the basis of certain tests that they apply to random num¬ 
bers and on the basis of their experiences. The repeated use of such 
tables has indicated that they bring one very, very close to satisfying 
the various theoretical definitions of probability sampling methods. 

It is well always to remember the danger of bias with judgment 
or convenience sampling. In these cases probabilities of selection 
cannot be ascertained. In fact, probability has no meaning if judg¬ 
ment or convenience rather than a random process is the basis for 
selection. The result is that there is no known way of generally eval¬ 
uating judgment or convenience sampling. Furthermore, many ex¬ 
periences where a later test was available have shown that these 
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likely to occur at least once. Of course if in twentv fi t eS l very 
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line^OthouT^falfetooi in 0 it h0 and aSy * t0 get a h ^ d ' 
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A somewhat similar bias may occur in acceptance sampling. 

Iffhe pSe/fhooTes to 'Ae^Tot^tl^t attempt- 

inS 'irsuwose^LTforTgiven decision rule the risk of accepting a 

times this probability more than doubles since. 

- .200 


.80 X .20 = .160 


P( Acceptance on First Submission) 

P(Rejection on First Submission and 

Acceptance on Second) = 

P (Rejection on First Two Submissions q 2 0 = 128 

and Acceptance on Third) — .80 X .80 X . - 

P( Acceptance on One of Three Submissions) 

The consumer’s risk thus increases rapidly if rejected lots are re- 
submhted Hence, the original risk of .20 becomes meamngles ^Sev¬ 
eral solutions are possible. One of the “ f ay p roduct 

possible, identify and fully check resubmitted lots of product. 

One other type of bias in the interpretation of the results of sann 
nling procedures often is caused by the failure to recognize that 
risks Of committing errors always exist when samples are the basis 
for a decision. Again we quote on the subject of a dentifrice. 

Emmvle 9.6 “Something called Dr. Cornish’s Tooth Powder 
came onto the market a few years ago ThS 

w”“tau“po«te .“33.’™“ S 

nn^ed to have demonstrated to be valuable for the purpose, ine 
pointlessness of this was that the experimental work had been pur y 
preliminary and had been done on precisely six cases . . . 

The poor decision illustrated by this last example rests with 
Dr Cornish’s failure to consider the risks associated with a sample 
of but ^x cases. The risk of marketing the product was obviously 

great. 

9.4 Bias of Nonresponse 

In our previous discussions we have assumed that once a sample 
»,Ltan.hr, unit, is pton.d, it is <*» ™3 

pertinent characteristics of those particular elementary units. 


±lbid., p. 38. 
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is, we have assumed only the possibility of a 100 per cent response to 
any sample. The failure of this assumption is called nonresponse. 

borne nonresponse will occur in most statistical studies. It is par¬ 
ticularly common if a study involves people as elementary units 
ihus nonresponse may arise in such studies from (a) people not 
found at home-perhaps moved, or even deceased; ( b ) a person’s 
telephone being busy, disconnected, or remaining unanswered: 
(c) refusals of persons to give information; (d) peoples’ inability to 
provide the pertinent information; and/or ( e ) people’s failure to 
take the trouble to respond to a questionnaire. Analogous causes re- 
su t m nonresponse m other types of statistical studies. To consider 
a few causes note the possibilities of lost, destroyed, or misfiled rec¬ 
ords outstanding checks, a lot of misplaced parts, library books in a 
bindery and not immediately available, etc. 

N °nres po nse may impair the usefulness of a statistical study The 
mam difficulty rests with the fact that respondents may differ from 
nonrespondents with respect to the characteristic of interest The 
1’fdh res P°f dents in a stud y provide only biased information 
tTestlT “ # The Study Suffers fro “ *e W» of 

itsdfTffi^following'wa?: 08 ^ 011 ° f StatiStiCS nonres P onse reflect ®d 

A sample of 208 manholes belonging to a large utility company 
ppea ing to the court for higher rates was drawn for the purpose of 
inspecting the manholes themselves, the ducts and the cables therein 
It was then discovered that 22 of the manholes designated for the 
sample had been paved over. Civic authorities would not let them be 

TheTffic d 'lt SUbStltUtl iif S i C0U i d n0t be P er ® itted > nor a new sample 

ffiem f hi Y Unfa y orably t0 the com Pany by declaring 

them to be one grade lower than averaged’ 5 ^ 

c^Jte aS y °^ d? ? e ’ if a11 P aved -° ver manholes were dis¬ 
carded from the sample, that part of the population would have been 
given no chance to appear in the sample. 

It is important to recognize that nonresponse may bias the results 
ot an attempted complete count as well as those of a sample. In a 
complete count there will be more respondents, but there also will 
be more nonrespondents, and the relative frequency of them will be 
about the same as if a sample (probability, judgment, or conven¬ 
ience) were taken. Any differences in the characteristic of interest 
between the two groups will lead to biased results no matter what the 
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design of a study. Thus, nonresponse is an equally important con¬ 
sideration in all types of studies. 

Example 9.8 In 1937 the United States Congress appropriated 
funds for a “census of unemployment.” Although a complete count 
was aimed for, provision was made only for voluntary responses. To 
increase the amount of response, the President appealed for co-oper¬ 
ation, a publicity campaign was conducted, and the Post Office De¬ 
partment undertook to leave a questionnaire at almost every door- 

step in the country. , . , , * 

The rate of response was high, at least for this type of survey. A 
subsequent probability sample for which nonresponse was minimized 
was taken in order to evaluate the census returns. It indicated that 
about two thirds of those who should have responded to the census 
actually did so. However, it also indicated that there were several 
significant and substantial biases, due in part to nonresponse, m the 
(attempted) complete count. 

The possible bias from nonresponse is sometimes ignored, and the 
respondents to a statistical study are considered a sufficient sample. 
But what type of sample do they represent? The answer is a con¬ 
venience sample.” As we know, convenience sampling has its appli¬ 
cations. However, providing a basis for an important decision is not 
one of them. Thus, the interpretation of respondents as a sample is 
particularly dangerous when that sample is blindly regarded as 
“representative.” 

One amazingly poor solution to nonresponse is sometimes sug- 
gested—increase the original sample size. Thus, suppose that a 
sample of 100 student answers to a questionnaire was desired and 
it were known, on the basis of past experience, that only about 25 
per cent of the students contacted would respond. Should a sample 
of 400 be drawn on the basis of the arithmetical calculation. 
400 x 25% = 100? The answer is no—-increased sample size will 
result in more respondents and more nonrespondents. Hence, it 
fails in itself, to decrease any bias of nonresponse. It seems that 
the inadequate solution of increased sample size often is based on 
the erroneous idea that any large sample is a good one. (You may 
reconsider Example 4.16 as an aid in debunking this thought.)_ 
What can be done about nonresponse and the bias it may interject 
into the results of a statistical study? The first general answer is try, 
in so far as is possible, to prevent it by a well-planned procedure for 
obtaining information.” 

Example 9.9 The settlement of an excise tax case required that 
the inventory of the taxpayer, who operates several variety stores, be 
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determined The examining agent of the Internal Revenue Service 
suggested that the inventory on a given date be taken so as to in- 

ventorvTrl T? Instmctions to clerks who were to take the in- 
br^nHs oftP, d6d thls /, e< l UI J rement - However, due to poor supervision, 

not ft T er ® " ot lsted on the inventory record sheets. This was 
not discovered until a later date. 

Thus, in a sense the study suffered from nonresponse-some perti- 

SiStf’ JW SUPP0Sedly drawn int0 the -mple,we 
“o wav Ivail d It °/ he mvent ° ry was far in th e Past, and there was 
tualTv a y. aila “ e to , re “°ve the nonresponse. If brands were ac- 

ebm 0 bett6r mven t° r y"taking procedure would have 

eliminated the nonresponse. 

The proportion of nonresponse in any study will depend, in part, on 
controllable factors. Thus, m the sampling of human populations, 
the questionnaire, the interviewers, and the publicity are among 
, 6 terminal!ts of the amount of nonresponse. Tor example, it has 
een o served, in many studies, that some interviewers have per- 
sistently good luck at inducing people to respond, while others have 
persistently bad luck. 

Few statistica! studies will be completely free of nonresponse even 
with the best-planned procedures. However, the proportion of non¬ 
response in a study may be minimized further by allowing for call¬ 
backs or recalls. Some nonrespondents may be turned into respond¬ 
ents by supplementary contact. Two, three, four, or more appeals 
often are used to attempt to eliminate a good part of the original 
set of nonrespondents. Letters, telephone calls, and personal inter¬ 
views may be used, for example, to supplement an original question¬ 
naire. Furthermore, a sample of nonrespondents may make the 
provision for recalls m a survey more feasible. 

Example 9.10 In one statistical study to determine interest in a 
adio program on the topic of child training, an original question- 
nane was sent to a sample of 600 women. About 16.8 per cent re¬ 
sponded. A follow-up questionnaire drew in another 34.1 per cent 
Then a probability sample of the remaining nonrespondents was con¬ 
tacted by telephone, and 97.2 per cent of those called co-operated 
It was found that only those interested in, and familiar with, the 
topic responded to the original questionnaire. Thus, it was pointed 

the ofivto ? W ?° COnducted the study that “if we had stopped with 

ttlce as ™ S 'V w ? uld have been Maimed that almost 

twice as many women knew about the program as really do.” 6 
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The problem of nonresponse—its causes and remedies has been 
plaguing planners of statistical studies for many years. Much re¬ 
search, therefore, has been done in this field, and a vast literature 
exists on the subject. It often is worthwhile to consult someone fa¬ 
miliar with this literature in the planning stage of a statistical 
study. 

9.5 Biased Observations 

In the last section we considered the possibility of missing obser¬ 
vations—i.e., nonresponse. Now we shall turn to a brief discussion of 
the possibility of biased observations—i.e., incorrect responses. For 
example, suppose that we were considering a population of diam¬ 
eters of steel rods. Our discussion to this point has assumed that for 
any rod selected in a sample, its true diameter will be observed. 
Similarly, assume that we are interested in a population of ages of 
employees and that a sample is drawn from this population. Unless 
the true ages are reported, we have a sample including not true 
population values but rather biased observations. 

It is unfortunate that biased observations are more commonplace 
than is generally recognized. They may result from many causes, 
several of which we shall survey in this section. As you shall see, 
some of these factors are more important than others in certain 
types of studies. Whatever their cause, however, it is necessary, to 
keep in mind that biased observations present risks of inaccurate in¬ 
formation and, hence, risks of incorrect decisions to a decision 
maker. Therefore, once again the problem in any statistical study is 
to try to minimize their occurrence by carefully planning the pro¬ 
cedures to be used in making observations. 

Consider, as examples of biased observations, the results , of 
weighing with an “unbalanced” scale or the results of measuring 
with an “inexact” instrument. Such results would be biased be¬ 
cause of an intrinsic imperfection in the method of observation. Note 
in particular that nonsampling errors such as these can be eliminated 
only by a careful testing and analysis of the method of observation. 
They will not tend “to even out” because of any assumed “law of 
averages.” 

In contrast to systematic errors of measurement, it should be rec¬ 
ognized that any method of measurement also will be affected by 
chance or random errors. Such errors may be due to air currents, un¬ 
controllable variations of temperature, vibrations of the measuring 
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instrument, etc. They often vary in a random manner from measure¬ 
ment to measurement and, hence, may be analyzed from the point of 
view of random sampling theory. 

Example 9.11 Consider a decision problem, perhaps involving the 
grading of a batch of a chemical compound for which observations 
are made by laboratory analysis. For any given laboratory procedure 
there will be some chance variation. Thus, if a sample of the com¬ 
pound were tested over and over again in essentially the same man¬ 
ner, different results would be obtained. Why? Because a multitude 
of “chance” factors would influence the results. One test might indi¬ 
cate a 2.31 per cent alcoholic content, a second test might show a re¬ 
sult of 2.17 per cent, a third test a content of 2.25 per cent, etc. This 
chance variation may be regarded as sampling error—the result of 
applying the laboratory analysis only a few of the unlimited number 
of times it could be applied. 

However, unless on the average the laboratory procedure yielded 
the true measurement, we would say that it also resulted in a system¬ 
atic error or bias. That bias would be due not to chance but to an im¬ 
perfection inherent in the method of analysis. Of course, bias, or 
the lack of it, can be determined only by comparing a given method 
of measurement with some other, supposedly unbiased, method. 

Similarly, methods of measuring length, width, height, weight, vol¬ 
ume, and so forth, may result in chance errors and systematic errors. 
Remember, then, that while random errors of measurement may be 
regarded as sampling 1 errors, systematic errors of measurement, 
must be regarded as biases. 

To consider somewhat different types of biased observations, let 
us turn to statistical surveys of human populations. Three simple, 
commonplace, and important causes of biased observations in such 
studies are (a) the questionnaire, (b) interviewers, and (c) auspices. 
Therefore, a brief discussion and a few examples of the influence 
of these three factors will be instructive. 

To elicit accurate information from people, a question must be 
framed carefully. First of all, it must be adapted for the method of 
making the observations in a study—that is, for a personal interview, 
for telephone calls, or for a request by mail. The wording of ques¬ 
tions must be simple and unambiguous. It must not be leading. An 
attractive and clear form in mail questionnaires is important Very 
often the preparation of a good questionnaire requires expert as¬ 
sistance. In most surveys in which a questionnaire is used it is desir¬ 
able to pretest it. The following examples indicate some of the many 
possible deficiencies in questionnaires. 
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Example 9.12 A public opinion survey might request an answer 
to the question: “Are you in favor of changing the filibuster system 
in Congress?” While this question appears reasonable, a study once 
indicated that only 48 per cent of the people questioned knew the 

meaning of the term “filibuster.” t , - 

The moral: In designing questions, use simple words which are la- 

miliar to potential respondents. 

Example 9.13 In a certain study homeless men were interviewed. 
When asked, “Are you married?” most replied, “No.” However when 
the question was asked indirectly as, “Where is your wife? a far 
larger proportion indicated that they were married. Such indirect 
questions often are good ways of getting valid information. 

Example 9.14 Interviewers for the “Monthly Report on the Labor 
Force” originally asked: “Was this person at work on a private or 
government job last week?” The question was interpreted by many 
respondents to exclude part-time workers. The question was then 
changed to: “What was the person’s major activity during the 
week?” When a person’s activity was something other than working, 
the interviewer also inquired whether the person “did any work for 
pay or profit during the week?” . _ , . 

It was found that the new form of questioning increased the esti¬ 
mate of the number of working males by 900,000 and of the working 
females by 700,000. The additional male workers were mainly stu¬ 
dents and the additional female workers were primarily housewives. 
More than half of these had worked more than 35 hours during the 
week in question. 8 

Thus, one always must be on guard to make sure a question ac¬ 
tually is yielding the desired information. 

Too often, it has been found, interviewers influence the observa¬ 
tions being made. Thus, some interviewers may cause, uncon¬ 
sciously, human respondents to take sides with them or to take sides 
against them. Bias attributable to the interviewer may arise from his 
political or social or religious beliefs which he superimposes on 
respondents in the way he asks questions or in the way he treats 
answers. In addition, bias due to interviewers often is introduced 
(1) by improper training; (2) by failure of interviewers to follow 
instructions as to questioning, as to areas to cover, and as to treat¬ 
ment of nonresponse; (3) by failure of interviewers to understand 
the purpose of a study; or (4) by interviewer fatigue. 


7 F. Stuart Chapin, Field Work and Social Research (New York: D. Appleton- 
Century-Crofts Co., Inc., 1920), p. 172. 

s Gertrude Bancroft and Emmett H. Welch, “Recent Experience with Problems 
of Labor Force Measurement,” Journal of the American Statistical Association, Vol. 
XLI (1946), pp. 303-12. 
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Example 9.15 A classic and often-quoted illustration of bias 
arising from interviewers is provided by a 1914 social study of 2,000 
destitute men. The reasons given by many of these men for their sit¬ 
uations varied directly with the interests of their interviewers. Thus 
the responses recorded by an interviewer with socialist tendencies 
showed a strong tendency to place the blame on industrial conditions. 
In contrast, the men interviewed by a prohibitionist showed a strong 
tendency to place the blame upon drink. 9 


To minimize interviewer bias, it is necessary to have a well- 
trained group of interviewers with specific instructions to follow. 
This^ ordinarily is possible, for most problems, only in sample 
studies which require only a small corps of interviewers. Large-scale 
surveys which require a large staff of interviewers increase tremen¬ 
dously the difficulty of training and offer more opportunity for 
variation in the quality of interviewing. This is one reason why, in 
Chapter 4, it was pointed out that samples may result in more ac¬ 
curate information than complete counts. 

At times it has been found that people will report differently, if at 
all, because their attitudes are influenced by the organization con¬ 
ducting the study, or by the purpose of the study. This is called the 
bias arising from auspices. Thus, it has been found that some per¬ 
sons will report differently, either unintentionally or intentionally, 
to studies carried out by governmental agencies and to studies car¬ 
ried out by private research organizations. 


Example 9.16 A number of years ago a census in a Chinese 
province, for military conscription and taxation purposes, found a 
population of only 28 million. Another survey in the province a few 
years later, but for the purpose of granting aid for famine relief, 
iound a population of 105 million. 


Example 9.17 One statistician once related the following incident: 
• remem ^ er a case in California not long ago, where they were 
talking about the closing of banks on Saturday. Two surveys had 
been run. One survey had been run to ask the general public about 
the employment policies of a certain bank. In that survey, they asked 
whether the bank should be closed on Saturdays. Over-whelmingly 
the people said, ‘Sure, yes, they should be closed on Saturdays.’ Ap- 
P ar ently the reason was that it would be better for the employee who 
would get the day off. The other survey talked about services to the 
public; and whether banks stayed open the right hours. They asked 
the same question, 'Should the banks stay open on Saturday?’ Just 


9 Stuart A. Rice “Contagious Bias in the Interview: A Methodological Note” 
American Journal of Sociology, Vol. XXXV (1929), pp. 420-23. g 
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the opposite results came in. The 
open on Saturday to give better 
The results were inconsistent 
frames of reference. 


public said the banks should stay 

service to the public.” 10 

for the respondents had different 


Even if all the afore-mentioned causes of biased observations are 
minimized, inaccurate responses will still occur for other reasons. 
Some responses reflect the lack of ability of people to provide ac¬ 
curate information; some others, the refusal to do so. Thus, biased 
observations may result from memory failures, guessing, accidental 
mistakes in responding, and “prestige bias.” The latter is the term 
used to explain wrong answers arising from a respondent s pride 
his reluctance to admit he reads a tabloid newspaper, or his down¬ 
grading of his age, or his upgrading of his salary. 


Example 9.18 One recent survey of readership and recognition of 
financial and business writers for New York newspapers indicated 
the following response bias. People were asked, among other things, 
if they read the column written by Kenneth Albridge. Of the respond¬ 
ents 5.7 per cent claimed to have read it at least occasionally. 
Several named the newspaper, a feat all the more remarkable because 
Kenneth Albridge was a fictitious columnist. Undoubtedly, some o 
this bias resulted from an accidental mistake in responding; while 
others resulted from an inability to say “I don’t know. The latter 
reflects a prestige bias. Incidentally, the purpose of that particular 
question was to provide some information on response bias. 


Example 9.19 A study on the ability of British government em¬ 
ployees to recall their absences during a year showed the importance 
of the memory factor in statistical surveys. Some employees over¬ 
stated the amount of sick leave taken. Others underestimated it. 
Many others, although they could recall being out, were unable to 
remember when they were absent. 11 


Example 9.20 In the 1950 Survey of Consumer Finances home- 
owners were asked to estimate the market value of their houses. 
Later, estimates of the same houses were made by professional ap¬ 
praisers. The discrepancies between the two sets of estimates were 
large. Thus, 63 per cent of the homeowners’ estimates differed by more 


to j Stevens Stock, “Non-sampling Errors in Sampling for Business,” Proceed¬ 
ings of the Conference on Business Application of Statistical Sampling Mei:ho s, 
(Monticello, HI.: The Bureau of Business Management, University of Illinois, 1950), 

11 Percy G. Gray, “The Memory Factor in Social Surveys,” Journal of the Amen- 
can Statistical Association, Vol. L (1955), pp. 344-63. 



Ch. 9] 


error and bias 


291 


Yet «,me bL »y study. o’nThe b»J,"?ft“r 

past experiences, statisticians and subject-matter experts often may 

ta . S.e»° .tadTl'f 7® df b' POl “ ti<J "■“*“*“* »' a «*> M-eee 
n a gl ven study. If they will be excessive and cannot be controlled 

oeth ‘b? "“r “* y b ' to "r *« .*C- 

to reformulate the problem so that another, more accurate decision 
procedure can be used to solve it. 

9.6 Biases in Data Flow 

+J 11 " ext ° ,)a P tcr we sh all study the details of data flow This 

topic deals with the collection of observations, the processing of 

or subsequent arhhmeticTl 

Thus, one source of this type of trouble in many surveys is a 
disorganized collection procedure. Poor supervision of interviewers 

terviewp 311 ^ S * ratlon ; Such a difficulty may result in ill-trained in¬ 
terviewers and, hence, interviewer bias. Or it may result in observa 

ions being collected twice, or not at all, from an elmentaryTffit' 

fOT the colTeclf 0 a f Sa K PlC ' P °° rreCOrd keepin S or “adequate forms 
for the collection of observations also provides an opportunitv for 

"•anized & n + W ’ ^ an lllustration of th e consequences of a disor- 
poffit °procedure, Example 9.9 may be reread at this 

In some studies it is necessary to edit and to code observations 
after they are collected but before they are tabulated and the 
pertinent statistics computed. For example, suppose that one mar- 

S town “fief the qUeSti ° n: “ What f6atUreS ° f this product 
appeal to you? Before answers are tabulated, they must be coded- 

fitted into various groupings. However, two coders or a coder and 

is supervisor may disagree on the particular grouping to which a 

given answer should be assigned. And, in any event, the opportu- 

m ies for bias from inaccurate coding are evident. Where coding is 

»<"* ^p™. instructions ..“S 

Horned^7*“ Climating the Value of 
38. American btatistical Association, Vol. XLIX ( 1954 ), pp ; 520 - 
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must be given workers. This also is another situation where sampling 
offers an advantage over a complete count— a smaller better 
trained work force to do more accurate work is possible with a sam- 
dIg. 

Arithmetical mistakes in tabulation and in calculation also repre¬ 
sent potential sources of bias in data flow. Such mistakes usually 
should be negligible, but at times they are not. However, they or¬ 
dinarily may be minimized by control totals and other numenca 
checks. Then, too, it usually is possible and desirable to use a sam¬ 
pling plan to control clerical accuracy. 

In conclusion, it should be noted that the insidious thing about 
any and all biases is their constancy and the consequent difficulty of 
detecting them. Furthermore, biases have a tendency to cumulate 
rather than to compensate one another. Our examples in this chap¬ 
ter point up situations where biases were found to exist or where 
they may be expected to exist. In how many other situations are 
they not detected—nor any thought given to the problem of de¬ 
tecting them? 

9.7 You Should Now Know That 

To check the trustworthiness of the results of a statistical study, 
ordinarily one must check the procedure by which the data were col- 
lected. 

Procedural biases, or nonsampling errors, are imperfections in¬ 
herent in the procedure used to elicit information. 

Biases will arise even though a complete count is taken. 

Biases usually are impossible to measure and often difficult to de- 

tect 

In contrast, sampling error is the difference between the result 
from a sample and the result which would have been obtained from a 
complete count taken with the same care as the sample. The risks of 
(probability) sampling errors are measurable. 

Almost every study is subject to some sampling error, some pro¬ 
cedural bias, or, more usually, both. . 

Generally, the only way of controlling biases is by planning a 
study carefully to limit the opportunities for their occurrence. 

Knowledge of ways in which biases may occur is therefore im¬ 
portant. . . . 

Biases may be due to the improper formulation of the decision 

problem—defining the population poorly, specifying the wrong de¬ 
cision-parameter, securing an inadequate frame, and so forth. 
Biases may arise because of an imperfect sampling procedure or a 
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S^‘“ d lhe ™ kS “ ■»!« » »m- 

Bias may result from missing observations (nonresponse) or from 
biased observations (inaccurate responses). 

sample ”“ resp '“'“ “ nnot *» "Ueviated by increasing lhe 

n«re‘an mT™*T T *(™ “ p ” rl >' “s™.! W«o»- 
i ” iU-tramed interviewer, the auspices of the study a re¬ 
spondent’s memory failure, prestige bias, and so forth 

cedurefeuitvS V° W ^ an unor S a ™ed collection pro¬ 

cedure, faulty editing, or coding of responses and arithmetical errors. 

9.8 You Should Now Be Able to Solve 

1. Discuss the following statement from the point of view of the 
subject matter of this chapter: 

A statistical inspection procedure, no matter how perfect it is statist; 
cally, ,s no good without the proper supervision. tlStl_ 

2. In a study of the burden of a gross receipts tax upon the firms 
that Pay the tax, the following question was asked: “Do you pass the 

C0S ,° f t he tax on t0 Purchasers of your products?” In what ways 
could this question lead to biased responses? Explain. 

3. A study of retail trade asks respondents to report sales in 
thousands of dollars. How might this request be a source of bias? 

m bV* h f S if en determined that a sample of 600 firms is needed 
o estimate the potential market for a new lubricant. A mail ques¬ 
tionnaire is to be used. Past experience has indicated that a non¬ 
response of about 25 per cent is to be expected. Therefore a random 

5. A mail questionnaire was sent to a random sample of 4 000 
individuals to determine their attitudes toward a new type of popu¬ 
lar science magazine. Questionnaires were tabulated as they were re¬ 
ceived. The results for the first 600 returns were as follows: 

Questionnaire Proportion Favoring 

Checked ■the Magazine 

600 . .52 

f S fl 1 hnl f , t +e e ih Pr0P0rti0n remained fairl y stable > it was decided not to 
tabulate the remaining questionnaires. Thus, additional processing 
expenses were saved. Pressing 

Evaluate this procedure. 






294 STATISTICS FOR BUSINESS DECISIONS [Ch. 9 

6. The table below shows the relatiw freq^ncy °f age 

reported in the Census of Population for 1880 and 1950. 


Digit of Age 

1880 


16.8 


6.7 

o . 

9.4 

Q . 

8.6 

.. 

| 8.8 

K . 

13.4 

A . 

9.4 

. . 

8.5 

Q . 

10.2 

q . 

8.2 

Total. 

100.0 



Analyze this table for possible indications of bias. Discuss your 

findings. 9 

How might these biases be overcome.'’ . 

7 A simple random sample of 300 adults is to be selected m a 

300 families and to interview one adult m each fam y. 

Is this sampling procedure biased? Explain. . 

S You are to conduct a poll to determine voting intentions i 

terviewed together. 

Evaluate the suggested procedure. , , . 

Q An interviewer is to visit the homes of a randomly selected 
‘ . f families If there is no one at home, he is to visit a farm y 

Ex XZXJXXXZXZZZX - 

not-at-homes. 

Evaluate this sampling procedure. 

9.9 You WiU Also Find That 

Discussion, analysis, and examples of biases are given m: 
Deming, W. Edwards. Some Theory of Sampling, pp. 24-52. New York: 
John Wiley & Sons, Inc., 1950. 

j. Myere “Accuracy o! Age 1 v^XLdJoW), PP- 826-31. 

sus,” Journal of the American Statistical Association, 
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Hansen Moeeis H Huewitz, William N, and Madow, William G 

Wife? & Sons! fnc 1953^ 7W ®'’ ^ PP ’ 5G “ 92 ' New York: Johl) 

PA dd E 403 M 24 D T' v r i eV K P ° llS T A Sam P les: Practical Procedures, 
pp. 403-24. New York: Harper & Bros., 1950. 

An interesting article on bias resulting from incorrectly formu- 
latmg a problem is: 

Kimball A W. “Errors of the Third Kind in Statistical Consulting ” 
I33I42 ° ^ Amencan Statistical Association, Vol. LII (1957), pp. 

For further description of misuses of statistics see: 

H ° W t0 ^ Witk Statistics ■ New York: w - W. Norton & 

Wallis W. Allen, and Robeets, Haeey V. Statistics: A New Av- 
proach, pp. 64-89. Glencoe, Ill.: The Free Press, 1956. 




CHAPTER ■ 10 


Data Flow 


10.1 Data Flow Defined 

In the preceding chapters we have considered the steps necessary 
in the planning of a statistical study. These steps may be summa 
rized as follows: 

1. Identify the problem in terms of alternative courses of action, 
states of the world, and possible consequences. 

2. Define the problem in statistical terms; i.e., define the popu- 
lation of study and specify the pertinent decision-parameter (s). 

3. Determine whether a decision can be reached without the 
benefit of observations or on the basis of available secondary data. 
If not, determine whether the problem requires a complete count, 
a judgment sample, a convenience sample, or a probability sample. 

4. If a probability sample is to be used: 

(a) Set the risks that you are willing to take in relying on 
sample information as a basis for action. 

(b) Determine the decision rule that will keep the risks of 
incorrect action within desired limits. 

(c) Plan the decision procedure so that procedural biases 
(nonsampling errors), as well as sampling errors, will be 

controlled. 

Thus, we have discussed how one determines what type ot mtor- 
mation is wanted, how it will be used, and, moreover, how one may 
plan a statistical study so that when it is carried out, the results will 
serve as a guide to a decision. 

In this chapter we shall survey several of the major steps and 
problems in the implementation of a statistical Study. Once elemen¬ 
tary units are drawn for a specified purpose, data on their character¬ 
istics of interest must be collected, these data processed and tabu- 

296 
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laS IP, St ? tiStiC ! CalCUkted ’ the a PPropriate action 

taken and a final report on the study written. This series of steps 
may be called data flow. It deals with the flow of information from 
the time that the sample of elementary units is chosen to the time 

fra deci e sion SerVatl ° nS ^ PreS6nted in final form t0 serve a s guides 

Unless the flow of data is efficient and effective, the cost and the 
time necessary to conduct a study are enormously increased. Fur- 
thermore, as we know from Chapter 9, data flow, unless well planned 
can be the source of several procedural biases which only add un¬ 
necessarily to the risks of relying on the results of a study. 

10.2 Methods of Data Collection 

Because we are concerned with the flow of data, it is appropriate 
to begin our studies with some discussion of methods of obtaining 
pertinent information from the elementary units under investiga- 

We may first note that in some studies, the method of data col¬ 
lection is simple and straightforward-observations can be made 
easily by direct observation. For example, to determine whether or 
not a glass jar is broken, we ordinarily need only to look at it; while 
to verify the diameter of a bearing, we need only to measure it. 

In other problems, the method of data collection may be a far 
more complex procedure. For example, it may require a lengthy 
laboratory analysis or an involved physical test of materials. It may 
necessitate the use of complex equipment—for example, X-ray or 
fluoroscope inspection often is used to determine whether or not 
naws exist m aluminum castings. 

As our introductory comments indicate, the exact procedure to be 
used to collect data from elementary units in any study depends on 
the nature of he observations, and, moreover, on the nature of the 
problem. Furthermore, it must be pointed out that the choice of a 
method is almost exclusively the responsibility of an expert in the 
subject matter; e.g., the engineer, the chemist, or the psychologist 
It is not obviously, a statistical problem, per se. Yet it is an impor¬ 
tant problem and must be given serious consideration in almost 
every statistical study. 

A discussion of most methods of data collection would go beyond 
the scope of this textbook. However, in the remaining paragraphs 
o t is section we shall consider various methods of data collection 
which are of importance m statistical studies dealing with human 
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populations and, hence, which arise very frequently. The possibility 
of inaccurate observations in such studies is great, as we have seen 
in Chapter 9. What, then, are the ways of collecting observations 
from people so that the risks of biased responses are minimized. 

Among the methods of data collection often used m surveys 
of human populations, the following are most important : (1) per¬ 
sonal interview; (2) mail questionnaire; (3) telephone interview; 
(4) panel technique; and (5) recording devices. There is a vast lit¬ 
erature on the use of each of these methods. Some of the literature is 
listed in the last section of this chapter. Here we can only discuss 

some of the characteristics of each method. 

There is no one best method for obtaining information. The 
method used will depend on such factors as the nature of the pro - 
lem the population, the time limit, and the available funds. Ve y 
often two of the methods can be combined. In any event, great care 
must be used in eliciting information by means of these methods, 
for improper techniques give rise to biases and, hence, affect the ac- 

curacy of a study. . 

1 Personal Interview. In the personal interview, questions are 

prepared in advance and are printed on a schedule on which the an¬ 
swers also are recorded. This method results m a relatively high 
percentage of usable returns from persons approached Personal 
contact enables the interviewer to clarify any misunderstanding o 
the questions. Sometimes he may be able to make observations to 
check on the accuracy of answers. Thus, a housewife might claim 
preference for one brand of coffee while another brand is visibly m 
use Sensitive questions can be interspersed strategically, and the 
respondent’s time can be utilized more effectively. By allowing more 
time for thought, recall can be facilitated. . 

The feature of personal contact, which makes the personal inter¬ 
view so desirable, is also a cause of its disadvantages. An inter¬ 
viewer must be careful not to allow his personal judgments to in¬ 
fluence the respondent. The inflection of his voice m reading a ques¬ 
tion a move of an eyebrow, a smile, or changing the wording of the 
question might exert an influence. Careful training and supervision 
of interviewers, therefore, is important. This requires a complex or¬ 
ganization. Such factors tend to make the personal interview a 
costly method. However, in considering cost, we must remember 
that the amount of nonresponse is much less than under other 
methods. Hence, fewer call-backs will be necessary. Thus when we 
consider costs in terms of usable information obtained, the costs of 
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personal interviews might not loom so large in comparison with 
other methods. 

2. Mail Questionnaire. The mail questionnaire is the most 
widely used method of data collection in survey work. It consists of 
a series of questions sent to the respondent and returned by him by 
mail. A mail questionnaire is of relatively low cost when considered 
m terms of questionnaires sent out. No staff training of interviewers 
is necessary, and a standardized set of questions can be prepared. 
Mail questionnaires avoid the possibility of interviewer influence 
upon the respondent. In the privacy of his own home, frank an¬ 
swers can be written at the respondent’s own convenience. A short¬ 
coming of the mail questionnaire is the high proportion of non¬ 
response. You are aware of this because no doubt you too often have 
filed such questionnaires in the waste basket. This weakness may 
make the costs of mail questionnaires relatively high when con¬ 
sidered in terms of usable completed questionnaires. Returns of 
questionnaires from the general public range from 10 to 20 per 
cent. Returns to special groups or to governmental agencies range 
lgher. Thus, the 1954 Census of Manufactures was conducted 
through a mailed questionnaire, with more than a 90 per cent re¬ 
turn. However, no matter what the rate of return, the failure to in¬ 
vestigate nonresponse may result in a serious bias in the results A 
mail questionnaire can be supplemented by follow-up letters, and 
then by interviews, to obtain answers from a properly drawn sample 
of nonrespondents. The combination of mail questionnaire and per¬ 
sonal interview is about the best and most economical combination 
It results m relatively low costs per unit of usable data obtained if 
the follow-up does not have to be extensive. At the same time the 
problem of nonresponse is alleviated through the personal inter- 

TT*I rvTTTfi 


Example 10.1 The use of a combined mail questionnaire and 
interview was reported recently in Applied Statistics, a British sta¬ 
tistical publication. The Ministry of Agriculture and Fisheries de¬ 
sired to determine the number of poultry and pigs kept in gardens 
and other nonagricultural holdings. No statistics had been gathered 

ww f ,m d ^ at t 10nIng had been removed in August, 1953. The Ministry 
wanted to determine the impact of the discontinuance of rationing on 

rn T 3 \[ aISed I T agricultura l holdings to see whether action to 
control the growth of such holdings was necessary. A letter accom- 
panied the questionnaire and was followed up by two reminder let¬ 
ters. The nonrespondents were then interviewed. In rural areas the 
response by mail was 87.9 per cent of the sample. Supplementary in- 
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terviewing raised this total to 99.5 per cent. In urban areas the corre¬ 
sponding figures were 94.1 per cent and 99.8 per cent. 

3. Telephone Interview. Costs of telephone interviews are rela¬ 
tively low, and a wide area can be covered without adding greatly 
to the expense. It is easy to train and supervise interviewers and to 
develop a standardized approach. Furthermore, it has been found 
that the refusal rate is low—usually not more than 2 to 3 per cent 
Its main applicability is to radio and television audience studies. A 
person is asked the name of the program that he is listening to and 
usually some questions about the product that is being advertised. 

Example 10.2 The method of telephone interviews for measuring 
radio (and also television) audience appeal was developed largely 
by C. E. Hooper, Inc. 2 The “Hooperating” was used as a guide m 
making decisions by both buyers and sellers of air time. The ques¬ 
tions asked of radio listeners were brief and only five m number. 

1. Were you listening to the radio just now? 

2. To what program were you listening, please? 

3. Over what station is that program coming in? 

4. What advertiser puts on that program? 

5. Please tell me how many men, women, and children were listen¬ 
ing to the radio when the telephone rang? 

The telephone interview can be effective only when the questions 
are few and not many details are required. It is difficult to detect 
misinformation. Most serious of all, however, is the fact that even 
though refusals are few, there are many no-answers, busy signals, 
and wrong numbers. At times these amount to one-third of the 
calls made. This creates a large amount of nonresponse, especially 
if no call-backs are made. The use of telephone interviews also limits 
the frame since nontelephone owners are excluded from being 
chosen for the sample. Care must, therefore, be taken to ascertain 
whether telephone owners provide an adequate population for the 
problem in question. 

4. Panel Technique. This method involves the selection of a 
group of respondents who are to be surveyed from time to time. 
Thus, we might have a consumer panel consisting of a few thousand 
women who are asked to report periodically on their reactions to 
forms of advertising, or to preferences for new products. The data 
are used to guide decisions as to advertising media and product de- 

1 Percy G. Gray, “A Sample Survey with Both a Postal and an Interview Stage, 
Applied Statistics, Vol. VI (1957), pp. 139-53. 

2 Matthew N. Chappell and C. E. Hooper, Radio Audience Measurement (New 
York: Stephen Daye Press, 1944), p. 62. 
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sign. The method also is used by the federal government to deter¬ 
mine consumer behavior patterns. In this latter instance panelists 
are required to keep records of their purchases. The information 
derived m this manner is used to determine the relative importance 
of the various components of the Consumer Price Index (food rent 
furniture, clothing, etc.). 

The use of panels facilitates the study of changing attitudes of 
consumers. If different samples are chosen at different times, it is 
difficult to assess whether any changes reflect a shift in attitude or 
result from the use of different samples. The usefulness of a panel 
requires that it be kept intact. This has been one of the great diffi¬ 
culties with the panel technique. Panelists change their residences, 
keep faulty records, change their minds in midstream, or just get 
“bored with the whole idea.” Another serious shortcoming of the 
panel technique is the psychological effect that being a participant 
has upon many panelists. It has been claimed that panelists fre¬ 
quently develop a critical “set”—or attitude—to questions. Panel¬ 
ists, having stated an opinion, often try to stick to it. They may thus 
be less subject to changes of opinion than the general public as a 
whole. 

5. Automatic Recording Devices. In keeping with an era of 
automation, various devices have been perfected to record observa¬ 
tions automatically. The photoelectric cell has been used to record 
the number of cars using municipal parking lot facilities at various 
times during the day. Such information is useful in making decisions 
on fees and maximum allowable parking time. It also can be used to 
determine shopping patterns in stores during the course of the day 
Other interesting devices have been perfected for radio and televi- 
sion audience surveys. 

( ( a 10.3 The A. C. Nielson Company utilizes a device called 
an Audimeter that is attached to a set. The Audimeter records once 
every minute to show whether the set is on or off, as well as the sta¬ 
tion to which the set is tuned when it is on. The Audimeter record 
tape is reviewed every month. 

Example 10.4 The Lazarsfeld-Stanton program analyzer is a 
device for determining what features of different programs appeal to 
radio and television audiences. The listener records his reaction bv 
pushing a green button to signify “like,” a red button to signify “dis¬ 
like, and by not pushing either button to register “indifference.” A 
permanent record of these reactions is kept. An interview usually 
accompanies the use of the analyzer to determine the reasons for the 
recorded reactions. 
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The advantages of these contrivances are that they do not de¬ 
pend upon memory nor are responses influenced by interviewers. 
Data can be obtained over an extended period of time without im¬ 
posing upon the respondent. On the other hand, these devices are 
rather costly, and the size of the sample is limited. It is difficult to 
obtain a probability sample of listeners, as all families selected by 
a random process might not be willing to permit the installation 
of the devices. Merely to record the opinions of families that do 
permit installation may result in biased information. Furthermore, 
the devices merely record that the set was tuned in on a given sta¬ 
tion. It does not follow that people are listening when a set is in 

use. 

10.3 Organization of the Collection Procedure 

Whether observations are to be collected by direct observation, 
by personal interviews, by mail questionnaires, by telephone 
queries, by some mechanical device, or by some other method, the 
collection procedure must be well organized. Of course, the exact 
nature of the collection procedure must be tied to the method of 
data collection to be utilized and to the special problem at hand. In 
this section we shall indicate only some of the more general require¬ 
ments of an efficient system for collecting observations. 

First of all, it must be noted that the proper records and forms are 
of importance. Thus, in an industrial inspection problem, a written 
record of the number of defectives and the number of nondefectives, 
or of the measurements made, must be maintained for any accept¬ 
ance sampling plan to be effective. To rely on an inspector’s 
memory in place of a written record would be foolhardy indeed. 
Similarly, in an inventory count, the proper forms will facilitate the 
collection of accurate observations. 

For statistical studies of human populations, for which the per¬ 
sonal or telephone interview method of data collection is used, there 
must be a suitable schedule. For other studies, such as those in which 
persons are contacted by mail, there must be a suitable question¬ 
naire. (Strictly speaking, a questionnaire is a form distributed 
through the mails or given to respondents to be filled out without 
any assistance; a schedule is a form filled out by an interviewer or, 
at least, in his presence. Often the terms are used interchangeably.) 
The collection of data and subsequent processing of these data can 
be facilitated by the proper schedule or questionnaire. Therefore, 
several considerations in their design must be borne in mind: 
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1. The physical appearance of the schedule or questionnaire may affect 
the amount and the accuracy of response. 

2. Questions should be as concise as is possible; moreover, the entire 
schedule or questionnaire should be as concise as possible. 

3. Questions must be worded clearly; they must be appropriate for the 
type of respondents expected to participate. 

4. Only questions which bear on pertinent information should be in¬ 
cluded; they should be formulated so as to yield this information. 

5. Usually questions should not be included if their answers can be 
obtained more accurately and more easily from other sources. 

6. Always pretest the schedule or the questionnaire. 

As the preceding remarks indicate, the choice of the proper forms 
and their design is important. These matters are often problems 
for specialists; almost always, they are problems for experts in 
subject matter rather than for the statistician. 

Another important factor in the organization of a collection pro¬ 
cedure is the hiring, training, and supervision of personnel—e.g., 
interviewers or inspectors. A well-organized collection procedure 
must be so designed that work loads are evenly distributed and 
reasonably scheduled. It also should include provision for (sample) 
checks on the actual performance of personnel. 

Even though the population and the purpose of a study are most 
carefully delineated, the questionnaire in perfect form, and the 
sample design most ingenious, the accuracy of a study will be 
greatly impaired unless the collection procedure is well organized. 

Example 10.5 Consider the experience with the census of indus¬ 
trial and commercial enterprises conducted in France in March, 1946. 

A cursory examination of the census questionnaires seemed to indi¬ 
cate that many firms had not answered the questionnaires. A sample 
survey was, therefore, conducted in urban areas of France to discover 
the degree of under-reporting that had taken place. From the sample 
study it was estimated that about 25 per cent of commercial and in¬ 
dustrial enterprises were overlooked. Furthermore, it was found that 
firms of all sizes were missed. It was concluded that the high propor¬ 
tion of enterprises overlooked in the census resulted from a disorgan¬ 
ized collection procedure. The result: The French government never 
published the results of the 1946 census of business. 

This illustration emphasizes a fact that we have mentioned ear¬ 
lier. Since a sample study is conducted on a smaller scale than a cen¬ 
sus, it is often easier to secure greater accuracy from sample studies. 
This is in part attributable to the fact that a collection procedure 
can be better organized and controlled more effectively in sample 
studies. 
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We cannot dwell at length on all of the many aspects of the or¬ 
ganization of collection procedures since many technical details are 
involved. You should be able to appreciate the complexity of this 
problem for many statistical studies by seeing what steps and pre¬ 
cautions a research agency actually took in its conduct of a nation¬ 
wide survey. 

Example 10.6 A few years ago a statistical study was conducted 
by Alfred Politz Research, Inc., for Life magazine to measure people’s 
interests in movies, magazines, radio, and television. The report on 
the survey is entitled: A Study of Four Media—Their Accumulative 
and Repeat Audiences. The design of the study began in the fall of 
1951 and took more than 1 year to complete. Actual interviews began 
in February, 1952, and it took 2% years to complete the entire sur¬ 
vey. This period of time was required because the scope of the study 
necessitated a very carefully planned collection procedure. Among 
the steps taken to achieve this result were the following: 

After the questionnaire had been prepared, 1,200 interviews were 
conducted in a pilot study in the New York metropolitan area. The 
pilot study had as its aim to test each factor which might affect the 
accuracy of the study. 

After the pilot study, a training program for interviewers began. 
Thirty-five instructors were used to train interviewers. Each inter¬ 
viewer was trained personally at training centers set up throughout 
the United States. The trainees had the purpose of the survey care¬ 
fully explained to them and were briefed on the subject matter of 
the questionnaire. Training in the art of asking questions also was 
provided. Instructors demonstrated the entire interview. Trainees 
went through the questionnaire with new respondents under observa¬ 
tions of the instructor, and appropriate suggestions and criticisms 
were offered by the instructor. 

As a final step in the training program, an actual survey situation 
was simulated. Test assignments were sent out to all interviewers. 
The interviewers w T ent to the designated areas (none of which were 
used in the final survey) to locate households, to select specific indi¬ 
viduals, and, when necessary, to make call-backs just as in an actual 
study. The field test was conducted under the direction of the home 
office which was to co-ordinate the actual survey. The completed 
questionnaires were sent to the home office where they were checked 
and critical comments made. Copies of the comments were sent to the 
interviewer and his instructor for discussion. 

In all, 207 interviewers were employed. During the conduct of the 
survey checks were made on interviewers by telephone, personal vis¬ 
its, post cards, and letters. Over-all, 7,141 respondents were inter¬ 
viewed, 36,868 interviews made, and 99,052 visits required. 

Obviously, most sample studies are not of the same magnitude as 
the one in the previous illustration. This does not mean, however, 
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that careful planning of the collection procedure is any less impor¬ 
tant in small-scale' studies. 

10.4 Processing the Data 

After observations have been completed, they must be processed. 
Processing includes the editing of observations for completeness, 
consistency, and accuracy. After the data are edited, they are ready 
for coding or classification so that the results can be tabulated or 
summarized. 

We say that a return is incomplete if, for a given respondent, some 
pertinent observations are missing. An attempt should be made to 
complete such returns. Failure to obtain such observations may re¬ 
sult in a serious bias of nonresponse. 

Observations also must be edited to determine their quality_ 

to detect, where possible, biased observations. In this connection 
there are several rules to follow. First of all, evidence of the quality 
of observations often is indicated in the data themselves. 

Example 10.7 In a survey of family expenditures, a family 
reports the ownership of a home. However, no expenditure is reported 
for property taxes. This seems inconsistent, and the matter should be 
investigated. 

This example illustrates a useful idea for many statistical studies. 
It often is desirable to make observations which verify other obser¬ 
vations. Such verifications will not necessarily provide conclusive 
evidence of poor quality. However, if inconsistencies appear, they 
should arouse suspicion, and, moreover, the observations should be 
rechecked. 

Example 10.8 A trade association conducts an annual survey of 
financial and operating ratios for its industry. Among the ratios re¬ 
ported by member firms are the following: net sales/inventory in¬ 
ventory/net working capital, and net sales/net working capital. 
You will observe that the product of the first two ratios is equal to 
the third. In this manner the consistency of the report can be checked 
for these items. Which ratio or ratios are in error cannot be deter¬ 
mined by the check, but questionable reports can be returned for cor¬ 
rection. 

In addition to inconsistencies, a return may contain values that 
are unlikely. A skilled editor can often recognize the causes of such 
unlikely values and take some action to correct them. 

Example 10.9 In a study of financial ratios the per cent of net 
worth that is invested in plant and equipment is requested. In a given 
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instance the percentage might be reported as .208. This is a very un¬ 
likely figure for the industry. What may have happened is that the 
respondent forgot to multiply the ratio by 100 to convert the answer 
to a percentage. 

At times discovery of the cause of an extreme value is not. as 
simple as in the previous illustration. In such instances, a decision 
must be reached in the face of uncertainty as to whether to retain or 
discard such extreme items. 

Example 10.10 A review of the results of a time study on an as¬ 
sembling operation showed the following times (in minutes): 

1.14 1.07 1.21 1.18 1.60 

1.10 1.18 1.02 1.11 1.25 

The time-and-motion-study man, in editing these data, may be sus¬ 
picious of the recorded time of 1.60. A check reveals no clerical errors 
or apparent unusual circumstances during the time of the operation. 
However, this number still may be the result of an out-of-control 
condition, not a normal condition. As such it would not be part of 
the sample from the population of interest. This population is the 
set of times which would occur if the operation were repeated over 
and over again under stable conditions. 

One solution to the problem is simply to discard the suspected ob¬ 
servation. This is a good solution only if two rules are followed. First 
of all, the decision to discard any observation should not be made 
after the data are collected—it should be made a priori by the for¬ 
mulation of a statistical decision rule. Otherwise, the sample data 
may influence not only the decision itself but the rule for making 
the decision, and the sample study may wind up showing what some¬ 
one wanted it to show. If so, there is no point in doing the study m 
the first place. Secondly, such an a priori decision rule must be ob¬ 
jective—that is, it must be based on a knowledge of the risks in¬ 
volved in the decision on whether or not to discard the extreme time. 
Thus, two errors may be recognized, namely: 

1. Discarding an observation when it should not be discarded be¬ 
cause it is not biased and merely large or small because of chance. 

2. Not discarding an observation when it should be discarded be¬ 
cause it is biased and is not extreme due to chance factors. 

The situation thus presents a testing problem and may be treated 
by methods similar to those discussed in Chapter 8. 

Unusual regularity in the responses may sometimes be observed m 
editing. This may indicate inaccuracies, and the returns should be 
reviewed. 

Example 10.11 In the 1900 Census of Population it was found 
that the number of Negroes listed as unable to speak English was 
suspiciously large. The matter was investigated, and it was found 
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«Stv t sche ^ ule - i ] n contiguous columns, contained questions on the 

c fl b «p ty f * n r r d ’ &hlh 7 t0 Wnt6 ’ and abi]i ty to speak English. In the 
ca^e of white respondents the answers usually were “yes yes yes ” 

The pattern of recording the first response three times was’carried 

English OfTT could neither read norite 

English Often after eliciting the “no” response to the first question 

on ability to read, careless interviewers would write “no, no, no” with¬ 
out asking the second and third questions. In many instances the re¬ 
plies would have been “no, no, yes.” 

While the “yes yes, yes” was a regularity in the recording of re¬ 
sponses that would not arouse suspicion, the recording of “no no no” 
regularly indicated a possible inaccuracy. The tabulations for’Ne¬ 
groes, therefore, had to be abandoned for that census. 3 

The afore-mentioned checks for accuracy of data are applicable 
no on y m editing but should be used to appraise numerical data 
wherever you may happen to encounter them. 

Example 10.12 A statistician made a survey 4 of the numerical 
data cited in the Kinsey report on males. On page 10 of the report it 

1221*4^ n p mber ° f individuals “solved in the study was 

nart! 'f 0 !^ 86 6 i ho T ever ’ ama P of United States shows the 
parts of the country from which the respondents came. The man 

contains 427 dots, and it is stated that each dot represents 50 people 
mi4 nd * that the StUdy in0luded 21 ’ 350 individuals rathe^ than 

After observations are edited, it may be necessary to code them 
Coding, is the classification of observations to facilitate their 
tabulation or summarization. The process of coding is particularly 
important for statistical studies in which there is the possibility of 
many different observations. These many different observations 
must be classified m order to have meaning. 

, j hU !i’ c ° nslder the use of a questionnaire. Questions generally can 

mt ° tW ° br ° ad gr0Ups: (1) fix ed-alternative questions 
and (2) tree-response or open-ended questions. ’ 

Example 10.13 Consider some questions that might appear in a 
marketing research study of consumers’ smoking habits. An illustra¬ 
tion of a fixed-alternative question is: 

Do you have a favorite brand of cigarettes? 


1. Yes 


2. No 


CD 

CD 


m 
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There may also be more than two alternatives as illustrated below: 

How many packs of cigarettes do you smoke in a day? 

Less than %_ 1— 1%— 2 more — 

Let's look at an open-ended question: 

What features of this cigarette advertisement appeal to you? 

There are no fixed choices of answers to this question. The respondent 
answers and the interviewer records the reason. 

Some questions combine both methods. 

Do you favor a flip-top cigarette box? 

1. Yes 

2. No 

3. No preference 

If your answer is “yes” or “no,” state your reasons. 

The first part of the question offers fixed alternatives, while the re¬ 
quest for reasons illustrates an open-ended question. 

The coding of fixed alternative questions is not always necessary 
if hand tabulation is to be used. The answers on the questionnaire 
may be used as a basis for sorting the responses. Thus, papers with 
“no” responses may be put on one pile and “yes” responses on a 
second pile. When replies are to be tabulated by machine, numbers 
or letters must be assigned to the different alternatives. The co e 
numbers may appear on the questionnaire as in the illustrations 
above Thus, in the question concerning preferences for a flip-top 
box, a 1 would represent “yes”; a 2, “no”; and a 3, “no preference.” 
At times the code numbers can be checked off on a specially pre¬ 
pared coding card. Such coding cards also could be used for hand 
tabulation since it is much easier to sort cards than question¬ 
naires. 

The coding of open-ended questions is a more complex operation. 
Categories must be determined so that each answer fits only one 
category and so that a category exists for each response. In coding 
open-ended questions it is customary to include a miscellaneous 
category. However, this category must not be too inclusive. Thus, a 
report from one survey on sources of local government revenue 
classified approximately 36 per cent of all tax receipts as miscel¬ 
laneous. Greater care in the design of appropriate categories would 
have reduced the magnitude of the miscellaneous group and would 
have offered additional pertinent information. Obviously, in all 
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coders^ 1 " 7 Pre<?1Se lnst,ructions and training must be provided for 


10.5 Tabulation of Data 

The tabulation of data involves the summarization of results in 
statistical tables. These tables may be employed for the purpose of 
computing statistics required by a decision rule, or for presenting 
numerical information in a final report. 

Plans for the tabulation of observations should be made at the 
time the questionnaire is designed. In fact, the design must take into 
consideration the ease of tabulating the responses. Such planning in¬ 
cludes determination of the information to be contained in a table 
its layout, and detailed instructions for tabulators. 

A form of table commonly used to summarize numerical data is 
the frequency distribution. We already have discussed absolute and 
relative frequency distributions. We now shall consider some meth¬ 
ods for the construction of such distributions. The tallying of the 
occurrences of various attributes or of variates can be performed 
fit er by h f; nd , or by machme - The greater the amount of data to 
be processed the greater is the advantage of machine tabulation. 
Machine methods will be considered in a later section. For the 
present, we illustrate an efficient procedure for the tallying of fre¬ 
quencies by hand. 

Example 10.14 The following 15 responses were received to the 
request for monthly rentals (in dollars) of dwelling units- 67 50 75 
74.60, 78, 69, 68.50, 73, 72, 77, 73, 7 4 ; 78, 67, Thftech- 

mque of cross-fives is a convenient method to count frequencies 
Four vertical strokes represent four frequencies and a diagonal line 
crossing them is the fifth, e.g.: s nal line 


$70-174.99. . y/Jj , , * 

$75-879.99 . TV/ ' 1 \ 

1 otal. —r 

. lo 

The construction of a frequency distribution involves the deter¬ 
mination of the best method of grouping the data into classes. Thus 
m Example 10.14, monthly rentals are divided into three classes. An 
important factor m the process of grouping data is the choice of 
class limits -the lowest and highest values of observations that 
may fall m any one class. Thus, in Example 10.14, the limits of the 
first class are $65.00 and $69.99. Class limits must be chosen so that 
they are mutually exclusive. That is, each item should fit into one 
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and only one class. Thus, the class limits shown below are not mu¬ 
tually exclusive, and should never be used. 

$1,500-82,000 

$2,000-12,500 

We could not determine whether a $2,000 item belongs in the first 
or second class. Another consideration is the class interval -the 
range in values for each class. The class interval is equal to the dif¬ 
ference between two consecutive upper limits (or between two 
consecutive lower limits). Thus, in Example 10.14 the class interval 
j g $5 00—$70 minus $65 or $74.99 minus $69.99. 

The choice of the classes, the class interval, and the class limits 
will depend on the purposes that the frequency distribution is to 

serve. 

Example 10.15 A buyer in a department store desires to know 
how many men’s dress shirts of each size he is to keep m stock A 
relative frequency distribution of men’s shirt sizes would enable him 
to make a decision. 

a ize Relative Frequency 


In this situation, the classes represent a single size. It would not be 

hel By Untrasrco^sider'a buyer who must determine the number 
of sport shirts of each size to keep m stock. Sport shirts come m the 
following sizes: small (14 and 14i/ 2 ), medium (15 and 15/ 2 ), arge 
(16 and 16%), and extra-large (17 and 17%). It therefore would be 
desirable to group the sizes as follows: 


Size 

14- 14J/2 

15 - 15 ^ 

16- 16^2 

17-17 H 


Relative Frequency 


While the basic criterion in setting up a frequency distribution is 
the purpose it will serve, it often is helpful to keep a few general 
suggestions in mind. To provide a basis for discussion of these sug¬ 
gestions, Example 10.16 presents a formal frequency distribution. 

Example 10.16 The following table presents some of the results 

of the 1953 and 1958 Surveys of Consumer 

nnnnoiiv nnr]fir the ausmces of the Federal Keserve 
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PRICES PAID BY PURCHASERS OF NEW AUTOMOBILES 
^IN THE UNITED STATES: 1952 AND 1957 


Price 

(in Dollars) 


Percentage of Purchasers* 


Under 1,500. . 

1 

1,500-1,999_ 

2,000-2,499.... 

. 5 

1 K 

2,500-2,999. 

3,000-3,499. 

3,500 and over.... 

. 25 

. 24 

. . 31 

Total. 

. 1001 


1952* SamPle ° f 250 Spending units fOT W67 and 245 spending units for 

t No cases reported or less than one half of 1 per cent. 
t Does not add to 100 since the nonresponse was 2 per cent in 1952 

NOn Scc nS BTsed 5 I W S S 1 Cent ' Data add to 100 because of mounding.' 

Source. Based on Supplementary Table 3 in “1956 Survev of Pnr 

Vsi , r b n Houains -'' ^sZesllfeZ', 

August, 1955, p, 817; and Supplementary Table 6 in "1958 Survev of 

S, n "r 8 T7n Purohase of Durable ****£ZZ£ 

A frequency distribution should not have too many classes. One of 
the reasons for grouping data is to reveal the pattern of the distri¬ 
bution. The use of too many classes tends to obscure this pattern 

in i! r lT?n’,;r W r t0 . USe 1100insteadof K0 ° class “tervals 

folbws Pl 10 ’ 16 ’ the ° laSS S1,50a ~ 81 ’ 999 would be broken down as 

$ 1 , 500 - 81,599 
$ 1 , 600 - 11,699 
$ 1 , 700 -$ 1,799 
$ 1 , 800 -$ 1,899 
$ 1 , 900 -$ 1,999 

We wouid have only 5 per cent to distribute among these five groups 
m 1952 and m 1957. This might mean that one or more of thes! 
groups would contain 0 per cent. In general, a few items would be 
spread over many groups, and no pattern would be revealed 
In many cases we must make use of unequal class intervals in 
order to summarize data adequately. This is especially true for very 

st aTl rJ?f rektive fre< 3 uen cies in the tail are 

l^smaU as «1 mn f i U • famil 7 income > we can use an interval 
as small as 11,000 for lower incomes because there are many families 

m such groups. As incomes increase it is necessary to widen the in¬ 
terval to $2,500, $5,000, $10,000, and so on. 

At times open-end classes must be employed in order to include a 
ew extremely large or small values without adding excessively to 
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the number of classes. Such classes either have no upper limit or no 
lower limit. Thus, in Example 10.16, the classes under 1,500 and 
“3 500 and over” are illustrations of open-end classes. 

In case of doubt, it is better to have too many classes m a first 
draft of a table than too few. Smaller classes can be combined into 
larger classes. However, larger classes cannot be broken down unless 
the data are retabulated. Thus, in Example 10.15, if we knew the 
proportion of men who wear small-size shirts, we could not deter¬ 
mine the proportion who wear size 14 or size 14/ 2 . However if we 
knew the proportions for each of the individual sizes, we could com¬ 
bine them into small, medium, large, and extra-large. 

A frequency distribution should not have too few classes. Too tew 
classes oversummarize the data and blur the pattern of distribution. 
The table based on the Survey of Consumer Finances oversumma¬ 
rizes the data for 1957. The “$3,500 and over” class could have been 
broken down into additional intervals to give us a better picture ot 
the distribution. However, this would have resulted in undersum¬ 
marization for the year 1952. To secure uniformity, and thus to 
facilitate comparisons between the two years, the same classes are 
used. This emphasizes the fact that the main consideration is al¬ 
ways the purpose of the table. , 

Class intervals should be chosen so that the mid-value of each 
class interval is approximately equal to the mean of the items in 
the interval. The mid-value of the class ordinarily may be taken 
as the average of the lower limits of two successive classes Thus, 
the mid-point of the class $1,500-11,999 is equal to $1,750—i.e., 

- (1 500 + 2,000). The concept of the mid-value as a class average 
fs essential if computations of statistics are to be based upon fre- 

auency distributions. ., . 

When included as part of a final report, the frequency distribution 
should be presented as a formal statistical table. As such it should be 
self-explanatory, clear, and attractive. Any format that accomp is es 
these ends is satisfactory. If you consult a book that contains sta¬ 
tistical tables such as the Statistical Abstract of the United States, 
you will find many acceptable formats. Let’s reconsider the table 
shown in Example 10.16 as a basis for discussing certain essential 
parts of a statistical table. 

First note that the table has a title. The title as a rule answers 
the questions: “What?,” “Where?,” and “When?” Each column 
has a column heading which consists of the characteristic (e.g„ 
price) and the unit of measurement (e.g., dollars). The first column 
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is called the stub and lists the classes. Footnotes are directly below 
the table, and the source follows the footnotes. Footnotes are 
needed to explain characteristics of a table that are not self-evi¬ 
dent but that are essential to an understanding of the data. The 
source is the only way the reader can trace the figures to their origin 
for checking or for obtaining supplementary information. The ab¬ 
sence of a source should arouse suspicion about the data in a table. 

The table has horizontally ruled lines following the title, after the 
column headings, between the total figures and the body of the table, 
and at the close of the table. Vertically ruled lines are used to sep¬ 
arate the columns. This method of ruling lines is not mandatory. 
It has, however, been found that such lines enhance the appear¬ 
ance of the table. They also make for greater clarity and ease of 
reading. 

10.6 Computation of Statistics: The Mean 

After the data are edited, coded, and tabulated, pertinent sta¬ 
tistics can be calculated. When the sample size is large, it is possible 
to save time and expense by making these calculations from a fre¬ 
quency distribution. Such calculations are useful especially in the 
determination of the sample mean and the sample standard devi¬ 
ation. The mean and the standard deviation, as you know, are com¬ 
monly used criteria for decision. In addition, the sample standard 
deviation may be used to estimate the population standard devi¬ 
ation which, in turn, may be used to estimate the standard error of 
the mean. 

The methods of computation that we shall consider are most use¬ 
ful when calculations are performed by hand or with a desk calcu¬ 
lator, or when secondary data are available only in the form of a 
frequency distribution. When tabulating machines or electronic 
computors are available, it is simpler to perform calculations from 
the raw data without grouping them in a frequency distribution. 

In this section we shall consider the computation of the arithmetic 
mean from a frequency distribution. We assume in the computa¬ 
tions that the mean of the items in a class is equal to the value of the 
mid-point of that class. Thus, in Example 10.17, we assume that 
the mean of the 6 items in the class $1.50 to $1.59 is equal to $1.55, 
the mid-value of that class. A similar assumption is made for the 
other classes in the distribution. If this assumption were true, then 
the mean of the data grouped into a frequency distribution would be 
exactly equal to the mean of the ungrouped data. However, this as- 
sumption rarely is fulfilled in practice. Nevertheless, if the class 
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limits are selected carefully, and if the class intervals are not very 
wide, the error resulting from this assumption will be relatively 
small. 

Example 1017 A trade association conducts a wage survey. It 
gathers information on the guaranteed minimum wage of inspectors. 
The work sheet for the computation of the average minimum wage is 
shown below: 


WORK SHEET FOR COMPUTATION OF MEAN 


(1) 

Wage 
(in Dollars) 

(2) 

Mid-value 
of Class 
m 

(3) 

Number of Workers 
/ 

(4) 

Total Value 
of Class 
fm 

1 50-1 59 . 

1.55 

6 

9.30 

1 60-1 69 . 

1.65 

11 

18.15 

1 70-1 79 . 

1.75 

26 

45.50 

1 80-1 89 . 

1.85 

40 

74.00 

1 90-1 99 . 

1.95 

32 

62.40 

2 00 2 09 . 

2.05 

21 

43.05 

2 10 2 19 . 

2.15 

12 

25.80 

2 20 2 29 . 

2.25 

8 

18.00 

2 30 2 39 . 

2.35 

4 

9.40 





Total. 


160 

305.60 


__ 2/m __ 305.60 
n 160 


$1.91 


The steps in the computation of the arithmetic mean are in- 
dicated in Example 10.17. The first column of the work sheet shows 
the classes; the second column, the mid-value of each class; and 
the third column, the frequency in each class. In order to compute 
the mean it is necessary to determine the sum of all the items. To ob¬ 
tain the sum of the items in each class, we multiply the frequency 
by the mid-value of the class. Thus, to obtain the sum of the items 
in the first class, 1.50 to 1.59, we multiply the frequency, 6, by its 
mid-value, 1.55. The sum of the items for each class is entered into 
the fourth column—the frequency times the mid-value, fm. The 
total of this column is the sum of all the items. If we divide this sum 
by the number of items, we obtain the arithmetic mean. In symbols, 
we write 


n 

The computation of the arithmetic mean may be simplified fur¬ 
ther by coding the data. To code data we add, subtract, multiply, or 
divide each data item by a constant. In this manner, it is possible to 
reduce the magnitude of the data or otherwise simplify them and 
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thereby facilitate computations. In Example 10.18, we have illus¬ 
trated the method of computing the mean from coded data. The 
first step in coding to compute the mean is to subtract a constant 
from each of the mid-values. If each mid-value is reduced by a con¬ 
stant amount, the mean will be reduced by the same amount. This 
simplifies the computation but gives us a coded mean. We can djecode 
the data by adding back the same constant. The constant is generally 
a mid-value near the center of the distribution. Thus, in Example 
10.18,1.95 has been subtracted from each mid-value. The results are 
shown in column (4). 

You will observe that each of the coded mid-values is an exact 
multiple of the class interval. This always will be true when all class 
intervals are equal. It is, therefore, possible to introduce an addi¬ 
tional simplification by dividing the coded mid-values by the class 
interval. Thus, in Example 10.18 the coded mid-values are divided 
by .10. The two steps of the coding operation can be summarized 
symbolically as follows: 


m ~ 1.95 


where d represents the final coded value. The values of d appear in 
column (5). Thus, for the first class 


We now find the mean of the coded data. To do so it is first 
necessary to multiply each coded mid-value (d) by its frequency 
(/) and divide the sum of the fd values by n. This gives us the 

coded mean, namely, — To decode the mean we must reverse 

our coding operations. We, therefore, multiply the coded mean by 

the class interval (.10) to counteract the earlier division. This 
• 6 4 

gives - We then add 1.95 to this result to counteract the effect 
of the earlier subtraction. Thus, the value of the mean is 


= 1.95 - .04 
= $1.91 . 

This is exactly the same as the result obtained by direct computation 
from the uncoded values in Example 10.17. 

Example 10,18 The mean of the wage data in Example 10.17 is 
calculated from coded observations in the work sheet that follows: 
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WORK SHEET FOR CALCULATION OF MEAN FROM CODED DATA 


(1) 

Wage 

(in Dollars) 

(2) 

Mid-value 
of Class 
m 

(3) 

Number of 
Workers 
/ 

(4) 

m — 1.95 

.(5) 

Coded Mid¬ 
values 
d 

(6) 

fd 

1.50-1.59. 

1.55 

6 

-.40 

-4 

-24 

1.60-1.69. 

1.65 

11 

-.30 

-3 

— 33 

1.70-1.79. 

1.75 

26 

-.20 

-2 

— 52 

1.80-1.89. 

1.85 

40 

-.10 

-1 

— 40 

0 

1.90-1.99. 

1.95 

32 

0 

0 

2.00-2.09. 

2.05 

21 

+.10 ! 

+ 1 

21 

2.10-2.19. 

2.15 

12 

+.20 

+2 

24 

2.20-2.29. 

2.25 

8 

+.30 

+3 

24 

2.30-2.39. 

2.35 

4 

+.40 

+4 

16 

Total.... 


160 



-64 


x = 1.95 + — X (Class Interval) 

n 

= 1.95 + X (.10) 

= 1.95 - .04 

= 11.91 

To calculate the mean from a frequency distribution, the mid¬ 
value of each class must be known. The mid-value of an open-ended 
class, however, cannot be determined. Thus, it is not possible to com¬ 
pute the average price paid for automobiles by consumers from the 
data in Example 10.16. Hence, it is important that open-ended in¬ 
tervals be avoided if possible when the mean is needed as a basis for 
decision. The average value of the items in an open-ended interval 
should be given in a footnote if such an interval must be used. 

10.7 Computation of Statistics: The Standard Deviation 

In this section we turn to the calculation of the sample standard 
deviation. Heretofore, we have computed the sample standard de- 
viation from the formula 


, = ^LZ^r (i « 1, 2, . . . , n) . 

\ n 

This formula has the advantage of pointing up the meaning of the 
standard deviation, but it is not the easiest to use for purposes of 
computation. To simplify the computation of s, we shall make use 
of the fact that 

2(xi — %) 2 = 2 Xi 2 — (i — 1, 2, n) . 

The formula for the standard deviation, thus may be written as fol¬ 
lows: 
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I ZX ,: 2 - 




1 . 2 . 


n) 


IZxF /2xi\ 2 

n \ n ) (^* —• 1 , 2, } n) 

Note that with this method, it is not necessary to compute the de¬ 
viations about the mean. We need only the values of 2 x t and 5ay \ 

Example 10.19 Suppose that we compute the standard deviation 
of the following 5 observations: 5, 8, 10, 4, 3. The mean of these ob¬ 
servations is 6 We now compute s by the use of both methods. The 
following work sheet provides the basic information: 


Xi 

Xi — X 

(Xi - ir) 2 

^ 2 

5 

-1 

1 

25 

8 

2 

4 

64 

10 

4 

16 

100 

4 

-2 

4 

16 

3 

-3 

9 

9 

Total 30 

0 

34 

214 


The sample standard deviation, from its definition is 

-4 


jX( Xi — x) 2 
n 

134 
5 

« VKS 

= 2.61 . 

We now use the alternative method of computing s. 


-4 


-4? T Wy 

-4 


1 214 _ /30y 
5 W 
= V42.8 - 36 
= V6J3 
= 2.61 

Observe that both methods yield the same result. 

A convenient formula for calculating the standard deviation from 
observations grouped in a frequency distribution is 

/ s/m 2 /S/toV 
n \ n J 9 

where m again represents the mid-value of a class and / the fre- 
quency in a class. 


-4 
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We can code observations to simplify the computation of the 
standard deviation in the same manner as for the mean. In Example 
10.20, we calculate the standard deviation for the same set of data 
for which the mean was computed in Example 10.17. Because we 
ordinarily compute the standard deviation at the same time as the 
mean, the work sheet illustrates the columns needed to compute 
both statistics. 

You will recall that the first step in coding is to subtract a constant 
from each mid-value. This operation does not affect the value of 
the standard deviation because the standard deviation depends only 
on deviations from the mean. If we reduce each value by a constant, 
the mean is reduced by the same amount. Thus, the deviations from 
the mean remain unchanged, and, hence, so does the standard devi¬ 
ation. Because of this, it is unnecessary to compensate for the sub¬ 
traction of a constant from each observation in computing the 
standard deviation. 

However, when we divide the observations by a constant, the 
standard deviation is also divided by the same constant. Therefore, 
it is necessary to multiply the standard deviation calculated from the 
coded data by that constant in order to obtain the final result. 

In the calculations shown in Example 10.20, 1.85 was subtracted 
from each mid-value (instead of 1.95 as heretofore) in order to show 
that any mid-value may be subtracted. The results were then di¬ 
vided by the class interval, .10. The coded mid-values are shown in 
column (5). 

Example 10.20 The work sheet for the calculation of the mean 
and standard deviation for the wage data in Example 10.17 follows: 


WORK SHEET FOR THE CALCULATION OF MEAN AND 
STANDARD DEVIATION FROM CODED DATA 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

Mid-value of 

Number of 


Coded 



Wage 
(in Dollars) 

Class 

Workers 


Mid-values 

fd 


m 

f 

m — 1.85 

d 

1 50-1 59. 

1.55 

6 

-.30 

-3 

-18 

54 

1 60-1 69. 

1.65 

11 

-.20 

-2 

-22 

44 

26 

0 

1 70-1.79.1 

1.75 

26 

-.10 

-1 

[ -26 

1 80-1 89. 

1.85 

40 

0 

0 

0 

1 90-1 99. 

1.95 

32 

+.10 

+1 

32 

32 

2 00-2 09 . 

2.05 

21 

+.20 

+2 

42 

84 

2 10-2.19. 

2.15 

12 

+.30 

+3 

36 

108 

128 

100 

2 20-2.29. 

2.25 

8 

+ .40 

+4 

32 

2.30-2.39. 

2.35 

4 

+.50 

+5 

20 

Total. 


160 



96 

576 
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x — 1.85 H—X (Class Interval) 

rb 

-1.85 + ^XUO) 

= 1.85 + -06 
= $ 1.91 . 

The value of the standard deviation is 

X (Class Interval) 


(?/ 


y?_ 

- M 'Jm - ©' 

= .10V3.60 - .36 


= .10V3.24 
= $.18 . 

Notice, in particular, that it is necessary only to multiply by the 
class interval in order to decode the results and obtain s. 


10.8 Machine Tabulation and Calculation 

In surveys where the number of respondents is large, or where 
there are many observations to be tabulated, machine tabulation 
usually is more effective than hand tabulation. It is generally stated 
that there should be at least 5,000 returns to process before machine 
tabulation can pay its way. However, this is just a rule of thumb. 
Thus, if an organization has tabulating equipment and free time 
available, it might be more economical to use the tabulating equip¬ 
ment even with fewer returns. 

Methods of machine tabulation were first used to tabulate the 
census of population data more than 50 years ago. Since then, tre¬ 
mendous improvements have been made in the machines, but the 
basic ideas remain the same. Today, there are two kinds of equip¬ 
ment m common use: the Hollerith machines (International Busi¬ 
ness Machine Corporation equipment) and the Powers machines 
(Remington Rand Corporation equipment). We shall devote our at¬ 
tention to the International Business Machine (IBM) tabulating 
equipment. 

The first step in machine tabulation is to punch information on 
cards. Bits of information are represented by various combinations 
of holes m a card. Figure 10.1 presents an illustration of a punch 
card. The card has 80 columns and 12 positions in each column where 
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data can be recorded. A single hole punched in a column represents 
a number. Two holes punched in a column represents a letter of the 
alphabet. Therefore, all responses must be coded into numbers and 
letters in order to be punched into the cards. 

Figure T0.1 


0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 H 0 0 0 C a 0 I 
ii 11 m 111 i n i i 111 n i 11 11 11 ’ 

2 2 2 2 ill 11 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 : 

333333133333333333333333333 

mmiiuumiHtimmi 

5 5 5 5 5 5 b 5 5 5 5 5 5 5 5 5 5 5 5 5 5 3 5 5 5 5 5 
666666666666666666666666666 


Example 10.21 Let us assume that a rental survey covers 5,000 
respondents. Among the information collected are nature, of occu¬ 
pancy, number of rooms, and monthly rental (actual or imputed). 
Returns are to be tabulated by machine. The responses are coded as 

follows: . . 

a) Each respondent receives a number. Since there are 5,000 re¬ 
spondents columns 1—4 are used to record the respondent s number. A 
column or group of columns that records data for a specific question 
(like columns 1-4) is called a field. 

b) Since nature of occupancy has two possible responses (owner 
occupied, or rented), 1 column (column 5) will be sufficient. Column 
5 thus represents a field. The digit 1 is the code for rented, and the 
digit 2 for owner-occupied. 

c) The actual number of rooms is recorded m column 6. The 0 
punch may be used to indicate 10 or more rooms. (Alternately, 2 col¬ 
umns might be used—then, the actual number of rooms up to 100 

can be recorded.) , 

d) The actual monthly rental to the nearest dollar is recorded m 
3 columns (columns 7-9). Any rentals of $1,000 per month or over 
are not recorded but coded as “000.” (Under this procedure it is not 
possible to calculate the mean or the standard deviation of the rents 
because the actual rents above $1,000 are not punched on the cards. 
Any frequency distribution based on the punched data, therefore, 
would be open-ended.) By reserving a fourth column for rent we, of 
course, could record actual rents of more than $1,000. Therefore, re¬ 
lating the tabulation plans to the ultimate purpose of the study is of 
prime importance. 
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Columns 10-80 of the card may be used to record any other perti¬ 
nent observations collected. 

In Figuie 10.2 a card is punched in columns 1—9. The respondent’s 
number is 1234. He rents his dwelling unit, which has 5 rooms, and he 
pays a monthly rental of $135. 

Figure 10.2 


N T f i? Monthly 
Respondent = *§ R en *al 


0000000000000 

1 2 3 4 5 6 7 8 9 10 11 12 13, 

I 1 11 II II 11 1 1 I 1 
2122222222222 
3313333133333 ; 
4441444444444 ^ 
5555515515555 
6666666666666 
7777777777777 


9 9 9 9 9 9 9 9 9 9 9 9 9 
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Data are punched into cards by means of a key-punch machine. 
The cards feed automatically into the punching unit. The holes are 
punched by depressing typewriting keys for alphabetical informa¬ 
tion and a 10-digit bank of keys for numerical information. After the 
cards are punched, the machine stacks them automatically and they 
are ready for subsequent operations. 

The speed with w r hich data can be punched into the cards has been 
increased through a “mark sensing” process. The holes to be punched 
are marked with a special pencil. The cards are then put through a 
machine which punches holes wherever a mark appears. 

If cards are punched by hand, the cards may be checked for accu¬ 
racy by a verifier. The operator repunches the data on the verifier, 
and the machine stops whenever there is a disagreement between 
the data in the card and the information being repunched. Another 
method of verification is to have the operator punch a new card and 
compare the two cards by machine. (See collator , below.) 

The punched data may be printed on the top of the card by means 
of an interpreter . This machine, guided by the holes, prints the cor¬ 
responding numbers and letters. However, the newer key-punch ma¬ 
chines print the numbers and letters as they are being punched. 

The sorter is a machine which separates groups of cards, or which 
arranges cards in a desired order. The sorter drops the card into a 
pocket that corresponds to the hole punched in a given column. 
Thus, in Example 10.21, if we wished to separate the “owner-occu¬ 
pied” cards from the “rented” cards, we would sort for column 5. All 
the “rented” cards would fall in the “1” pocket, and the “owner- 
occupied” cards in the “2” pocket. A sorter counter may be attached 
to the sorter to count the number of cards in each pocket. Sorters 
work at a rate of 450 to 1,000 cards a minute. Thus, in about 5 to 12 
minutes we could determine the number of owner-occupied and 
rented dwelling units for 5,000 respondents. 

The sorter can also arrange observations in an array. For example, 
it could arrange rents in order from the lowest to the highest value. 
This would facilitate the construction of a frequency distribution of 
rents. 

The tabulator is a machine that can add, subtract, and print infor¬ 
mation. Tabulators can print about 100 to 150 lines a minute. They 
can add about 150 cards a minute if only totals are to be printed. 
Thus, for the rents in columns 7-9, the tabulator could add and re¬ 
cord the total rent paid for apartments with 1 room, 2 rooms, 3 rooms, 
and so on. It also could record the number of cards in each apart- 
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ment-size group. It then could print the grand totals for rents and 
the number of cards. All these operations could be performed in ap¬ 
proximately 40 minutes for 5,000 cards. 

The reproducer and summary punch is a machine that may be 
hooked up to the tabulator. It punches summary information into a 
card. Thus, if there were 500 3-room dwelling units with a total rent 
oi *28,000, the machine would punch this information into predesig¬ 
nated fields of a summary card. The reproducer automatically veri¬ 
fies that the information being punched is identical with the totals 
in the tabulating machine. 

The multiplier or calculator is a machine that can multiply or di¬ 
vide numbers punched into two fields. It also can add the numbers in 
a third field to a product or quotient. Furthermore, it can round off 
decimals to any place desired. The results are punched into a field 
that has been set aside for that purpose. In the rental survey illustra¬ 
tion, the cards punched on the reproducer-summary punch machine 
could be put through the multiplier to get an average rental for the 
dwelling units in each class interval, and for all the 5,000 dwelling 
units. Thus, the multiplier could divide *28,000 by 500 to obtain $56 
as the average rental for the 3-room dwelling units. 

Another useful machine is the collator. Let us assume that rents 
are arranged in array form on two sorters. The collator could merge 
the cards m both stacks so that the array form is maintained. Also 
if we wished to select, from the cards, all rented dwelling units with 
3 rooms and rentals over 100 dollars, the collator could perform this 
operation. It would then replace the cards in their original order 
Electronic computers which combine all the operations of the 
afore-mentioned tabulating machines have been developed in recent 
years. These computers also perform other useful operations, all at 
a tremendous rate of speed. Some computers, for example, can per¬ 
form as many as 35,000 additions in one second. The speed at which 
these computers operate has introduced a new era in expediting the 
flow of data, and in the solution of problems in all fields of endeavor. 
However, we still are unaware of all the potentialities of computers. 
Data must be “programmed” before they are processed by a com¬ 
puter. Programming is a plan for processing data. A program consists 
of a set of instructions that specify the operations a computer has to 
perform and the sequence of these operations. 

The electronic computer requires a longer time for the preparatory 
cycle m data processing than other methods. It is of utmost impor¬ 
tance, therefore, that the planning be done very carefully. To the 
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extent that planning is poor, or unexpected difficulties occur after 
processing begins, the computer very easily may magnify rather than 

reduce the problems of data flow. 

As in the case of machine tabulation, data must be prepared m a 
form that the computer can accommodate. There are three methods 
of data preparation: (1) punch card, (2) paper tape, and (3) mag¬ 
netic tape. To record information, holes are punched into cards, holes 
and/or spots are put on a paper tape, or magnetic spots are placed on 
a magnetic tape. These holes or spots represent numbers, letters, spe¬ 
cial symbols, or instructions. The magnetic tape economizes on 
space since as many as 534 impulses can be marked on an inch of 
tape. The cards are least economical as space savers. 

Cards or tapes are prepared with input preparation equipment. 
The data then are carried into an input reading mechanism. The in¬ 
formation is translated into electronic equivalents that are fed into 
the processing unit. The processing unit consists of a memory unit, 
a control unit, and an arithmetic and logic unit. 

Data and instructions enter the memory unit first. This is a de¬ 
vice for storing the information coming from the input unit, and 
holds data until they are needed for processing. After data are proc¬ 
essed, they may be returned to the memory unit for future use.. (In 
some computers it takes only 12 millionths of a second to find a given 
number, when needed, in the memory unit.) 

The control unit interprets the instructions conveyed by the im¬ 
pulses and directs the sequence of the operations to be followed. 

The arithmetic unit can perform the following operations: addi¬ 
tion, subtraction, multiplication, and division. The basic operation 
is addition. Subtraction is performed by the addition of comple¬ 
ments. Multiplication is nothing more than successive additions, and 
division consists of successive subtractions. 

The machine also is able to translate final results into decisions. 
The decision process consists of comparing numbers and indicating 
some course of action on the basis of the comparisons. 

Example 10.22 The U.S. Bureau of the Census is required by law 
to protect individual respondents to its surveys from the disclosure of 
their reported figures. In this connection, the Bureau uses the follow¬ 
ing decision rule: Information (such as manufacturers’ sales) is not 
published if the two largest firms account for more than half of the 
total in a category (e.g., more than half the sales of a product in the 

United States). . , 

Since the Census Bureau has many categories to which to apply 
this decision rule on disclosure, it recently has adapted the use of an 
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electronic computer to the problem. The machine is directed to add 
values for a category under consideration, to add the values of the 
wo largest firms, and compute the ratio of the latter tote fol er 
If the ratio is .50 or less, the machine prints a symbol indicating that 
no disclosure is being made. If the ratio is more than .50, the machine 
indicates the nature of the disclosure, the total for the two largest 
companies, and their identification numbers. S 

Heretofore, such checks on disclosure had to be made by clerks a 
far more time-consuming system. 6 J 

The results of the processing operations are reported by means of 
an output unit. The results flow in the form of impulses from the 
memory unit to the output unit where the results appear as holes in 
cards, spots and holes on paper tapes, or spots on magnetic tapes. 

ese are used to print the information. Some machines feed the 
impulses from the output unit directly to a printing device. 

. The'electronic computer uses a language that has only 2 characters 
m it no impulse which represents 0, and an “impulse” which rep¬ 
resents 1. Thus, it is necessary to convert all numbers, letters and in- 
structions into a binary system: a system which utilizes only a 0 

Example 10.23 We illustrate the use of the binary system for the 
digits 0 to 9 and for the letters in the abhahet- 7 y the 


Decimal Digit 

0 . 

1 . 

2 . 

3 . 

4 . 

5 . 

6 . 

7 . 

8 . 

9. 


Binary Form 
. .. 0000 
.. 0001 
.. 0010 
. . 0011 
.. 0100 
.. 0101 
.. 0110 
.. 0111 
. . 1000 
.. 1001 


Thus, 4 circuits are needed for each digit. The number 862 would 
be written as: 1000 0110 0010. 

For letters, 6 circuits are used. A is 01 0001, B is 01 0010, C is 01 
0011 and so on. Since there are 64 possible combinations of binary 
digits using 6 circuits, there are more than enough to take care of the 
26 letters of the alphabet. The remaining combinations are used to 
designate certain symbols and punctuation marks. 

. ha . Ve dlscusse 4 in an elementary way, some of the basic prin- 
ciples an d uses of tabulating machines and electronic computers The 
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proper use of these machines, as you can see, will facilitate the flow 
of information in statistical studies. Nevertheless, their use makes 
the problem of planning for data flow exacting. Plans must be 
clearly and carefully made if machines are to aid in the decision¬ 
making process. Remember that the machines themselves can do no 
more than they are directed to do. 

10.9 The Final Report 

The final step in a statistical study is to record the results m the 
form of a final report for future as well as for immediate reference. 
The most carefully planned and executed study is wasted unless the 
results are conveyed to the people responsible for carrying out and 
supervising the action indicated by the study. Furthermore, a record 
of the experiences of the study should be kept for reference in simi¬ 
lar studies which may be conducted subsequently. 

Although each problem situation calls for its own special type of 
report, and although each report presents its own special require¬ 
ment^ there are some general aspects of report writing that we may 
discuss. First of all, the final report should usually appear in two 
parts. The first part should summarize, in a nontechnical way, the 
conclusion—the statistical findings and the action indicated by these 
findings. The second part should be devoted to a more detailed ex¬ 
planation of the more technical aspects of a study, including a de¬ 
scription of the sampling procedure, the risks involved, and so forth. 

The nontechnical part of a report may assume many forms. It may 
vary from an oral communication or brief memorandum to a formal 
published document. As a rule, reports of studies conducted within 
an organization to solve purely internal problems are brief and in¬ 
formal. However, reports of studies conducted by outside consultants 
assume a more formal character. Similarly, in large organizations, 
methods of reporting may be highly formalized. The results of sec¬ 
ondary studies, especially those designed for wide use, generally are 
presented as formal publications. 

As a minimum requirement, even the most informal nontechnical 
part of a report should contain a statement of the problem and a 
summary of the findings. Thus, a report on an estimation problem 
should include the reason for the study and the resulting estimates. 
A report on a testing problem should contain a statement of the pur¬ 
pose of the study, the sample results, and the appropriate action in¬ 
dicated by these results. Thus, an inspection report on whether to 
accept or to reject a lot should contain the nature of the material in- 
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specteci, the sample size, how the sample was drawn, the number of 
f , ° 1VeS + n the 0t ’ and the action to be taken on the lot. To facili- 
sfgned eP ° * ° n SU ° repetitive studies > s P e “al forms may be de- 

The second part of a report should include additional more 
technical, information concerning a study. This part of a report also 

i^e/Zhinf r tUre ° f - he StatiStiCal StUdy and the P roblem 

itselh Such information is particularly important for secondary data 
which were not collected under the auspices of those using the resets 
for a decrsion. Here we shall list the type of information that should 

you should 111 6 reP ° r + t ° f any secondar y study. At the same time, 
ld ree °g nlze that it also is necessary to include each of 

thth “ a reP ° rt ° f the results of a Primary study 

( ich, after all also may at a later time and for another problem 
become a secondary study). p m 

maSTon ffie 55“^ ^ ° f & final rep ° rt should include re ‘ 

marks on the following eleven aspects of a study 

usually noS?H ^ ° f the 8tudy ' Secondar y studies are 

usually not directed at answers to a specific problem. Therefore the 

consumer of such studies should be informed on why the study was 

oTt'trm °T w “ ii k 

oi tne study may be utilized. 

si Jl h D : SmPh Z °{ the . population - A Precise description should 
give the geographica! region and the categories of material covered 
by the study. It should describe the frame that was utilized Care 

SThe study! 1 ^ there 18 n ° misunderstandin g ab out the coverage 

3. Nature of information collected. The report should show the 
information collected m sufficient detail. Information collected but 
not reported on should also be mentioned. Thus, hourly wages might 
be summarized as “$2.50 and over,” but if detailed figures of artua 
wages were collected, this point should be mentioned. If possible the 

sss z the d *“ ■» 

4. Method of collecting data. For example, a report should indi¬ 
cate whether personal interview, telephone interview, mail ques- 

’f° r a "° mbination of these methods was used, 

scribed The°^m J amP 9 ' ^ ° f samplin S should be de- 

. e sample size, as well as the arrangements for call-backs in 

case of nonresponse, should be given. Also the way in which the 
sample size was determined should be discussed 
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6. Decision rule. The decision rule should be given in full, to¬ 
gether with a description of the manner in which it was determined. 

7 Repetition. Is the study a one-shot operation, or is it one in a 
series of similar surveys? If the latter, then differences between this 
and earlier surveys should be pointed out. , 

8. Date and duration. This includes the period of time to which 

the data pertain. , , , 

9 Assessment. A statement should be made of the extent 

which the study actually fulfilled its intended purposes. 

10 Responsibility. The name of the organization sponsoring the 
study, as well as the names of the people conducting it, should be 
stated. This information often is a rough indicator of the quality of 

a n^Accwroci/. A general indication should be given of the degree 
of accuracy attained. If one is to base decisions on the results of a 
study, he must know the magnitude of the errors involved. The 
magnitude of the sampling error, as well as the possible sources of 
procedural biases or nonsampling errors, should be indicated. If 
possible, an estimate of the magnitude of the biases should be 

S1 Every bona fide study should contain information of the type de¬ 
scribed in this section. One should view with suspicion statistical 
studies which do not include such information m their reports and 
whose sponsors refuse to furnish such facts upon request. 

10.10 You Should Now Know That 

Data flow deals with the flow of information from the time that 
the sample of elementary units is chosen to the time that the obser- 
prated in final form to serve as guide, (or decston. 
There are many methods of data collection including direct ob¬ 
servation, the personal interview, the mail questionnaire, telep one 
questioning, and mechanical devices. 

q The choice of the method of data collection depends upon the na¬ 
ture of the problem, the population of study, costs, and time con- 

Sld The organization of the collection procedure is important in small- 

ip a nd in large-scale studies. , . 

Processing the data requires that they be edited to determine their 
completeness, consistency, and accuracy. It also requires the coding 

. , c *Ti_X_ InKlllo+lAri 
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The tabulation of data involves the summarization of results in 
statistical tables. 

The form of a frequency distribution is determined by the purpose 
it is to serve. 

It sometimes is possible to save time and expense by computing 
statistics from a frequency distribution. 

It may be more efficient to compute statistics from coded observa¬ 
tions than from original observations. 

In large-scale studies, machine tabulation is more economical than 
hand tabulation. 

Electronic computers, if utilized efficiently, can be of consider¬ 
able use in facilitating the flow of information in a statistical study. 

A final step in a statistical study is the recording of the results 
in a final report. 

10.11 You Should Now Be Able to Solve 

1. Critically and constructively discuss the following statement: 

All of the steps in data flow including collection of data, editing, 

coding, tabulating, and computing are essential parts of every sta¬ 
tistical study. Therefore, each of these steps requires a 100 per cent 
verification of the accuracy of the work done. 

2. Instructions often are mailed to interviewers participating in 
nation-wide market research studies. What are the possible short¬ 
comings of this procedure? 

3. For each of the following situations, discuss which method of 
data collection (personal interview, mail questionnaire, etc.) would 
be preferable. 

a) A study by the federal government to determine the value of retail 
sales each month. 

b) A survey by a trade association to determine the attitude of its 
members toward the publication of a trade journal. 

c) A study by the personnel department of a firm to determine atti¬ 
tudes of employees toward a contemplated plant cafeteria. 

d) A study to determine the attitude of physicians in New York City 
toward a recently marketed antibiotic. 

e) A study by a local real estate firm to determine the number of 
homes for sale in the area. 

4. The decision rule for an inspection procedure is as follows: Se¬ 
lect a simple random sample of 100 pieces. If 3 or less are defective, 
accept the lot; otherwise reject the lot. The number of defectives' 
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found in the samples selected from the first 50 lots inspected, was 
tabulated as follows by the inspector: 

Number of Defectives 

in Sample Number of Lots 

0. 3 

1 . 5 

2 . 15 

3 ...... 26 

4 . 0 

5 . 1 

6 . 1 

Total. 50 


Do these results appear suspicious? Explain. 

5. In reply to a questionnaire a company reports the following 
ratios for the year 1958: 

Cost of goods sold/sales.78 

Administrative expense/sales.08 

Selling expense/sales.15 

Net profit before taxes/sales.05 

Would you question the accuracy of this return? Explain. 

6. The XYZ Corporation has 1,018 accounts receivable. It selects 
a sample of 100 accounts to estimate the average value of its ac¬ 
counts receivable as at December 31, 1958. 

The 100 sample observations (in dollars and cents) are as follows: 


15.00 

5.60 

44.08 

81.65 

8.85 

88.37 

43.07 

63.29 

35.70 

36.19 

51.10 

25.30 

22.25 

45.50 

37.63 

46.24 

56.34 

44.80 

50.04 

55.00 

16.10 

12.80 

40.75 

13.90 

46.36 

40.15 

74.60 

59.05 

43.70 

34.83 

48.94 

56.43 

42.35 

64.80 

54.07 

53.71 

28.65 

67.80 

57.23 

42.97 

43.61 

41.25 

45.15 

43.38 

32.70 

54.60 

79.35 

10.50 

60.28 

37.39 


20.50 

25.75 

35.64 

36.87 

65.85 

17.08 

76.40 

45.50 

49.61 

41.85 

53.36 

47.84 

42.56 

23.50 

70.65 

47.42 

55.10 

52.09 

49.16 

39.70 

27.75 

48.29 

26.25 

52.81 

31.80 

51.30 

62.00 

50.60 

42.56 

42.63 

2.30 

33.35 

45.90 

65.10 

38.40 

66.30 

30.90 

34.15 

58.15 

29.50 

44.46 

68.90 

19.16 

69.70 

59.95 

85.50 

33.20 

58.85 

24.70 

46.10 


a) Summarize the data in a frequency distribution. Use a $10 class 
interval, with $0.00 to $9.99 as the first class. _ 

b) Present the frequency distribution as a formal statistical table ior 
use in a final report. 

7. Compute the mean and the standard deviation of the sample 
from the frequency distribution in Problem 6. 

What would be your estimate of the average value of accounts re¬ 
ceivable in the population? 

Compute the standard error of the estimate. 
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Estimate the total value of the population of accounts receivable 
and compute the standard error of this estimate. 

10.12 You Will Also Find That 


. , * followin &' references discuss many of the aspects of data flow 
including methods of data collection, organization of the collection 
procedure, and methods of processing and tabulating data, with par¬ 
ticular reference to studies of human populations: 

DE Tnh N< wT' E ™ ARDS T Some Theory of Sampling, pp. 1-23. New York- 
John Wiley & Sons, Inc., 1950. 

PA Npt N V M t LD w D ' SU I v Jt ys ’, Polls ’ and Samples: Practical Procedures 
New York: Harper & Brothers, 1950. 

Almost every statistics textbook discusses the construction of fre¬ 
quency distributions and methods of computing statistics from such 
distributions. For example, you might consult: 

Croxton, Frederick E, and Cowden, Dudley J. Applied General Stn 

iTwll 153 ~ 70 ’ m “ 82 ’ 215 - 22 ' M 6d ' NeW Ymk: 

Mi lls Fredermk C. Statistical Methods, pp. 40-64, 89-94 116-20 Id 
ed. New York: Henry Holt & Co., 1955 ’ ’ 2 °' 3d 

N 25 Z.fifi G ?i’i^ I s LL ^ M A J E J T ementar y Statistical Methods, pp. 224-50 
256-66, 311-18. Rev. ed. New York: Macmillan Co., 1956. 

A memorandum entitled “Recommendations concerning the Prep¬ 
aration of Reports on Statistical Surveys” was prepared by the 
United Nations Subcommission on Statistical Sampling in 1948 The 
memorandum is obtainable from the United Nations Statistical Of- 
nee and also is reprinted, in part, in: 

Yates, Frank Sampling Methods for Censuses and Surveys, pp 141-44 
New York: Hafner Publishing Co., 1949. 




CHAPTER • 11 


Continuing Decision Problems and the 

Control Chart 


11.1 Introduction 

In this chapter we shall consider the application of statistical 
theory in continuing decision problems. A continuing decision prob- 
lem, as the name implies, is a problem situation which requires al¬ 
most unceasing attention—one which calls for frequent decisions on 
whether or not to take a certain course of action. 

Example lid An investor usually faces a difficult problem when 
he decides whether or not to purchase a particular security. It is a 
problem that can be rationally resolved only after carefully investi¬ 
gating a company. . . . 

However, when an investor makes a decision to purchase some se¬ 
curity, he should recognize that he will face an even more exacting 
problem—a continuing decision problem on whether to retain or to 
sell the security. This is, obviously, a persistent and unceasing prob¬ 
lem which calls for almost continuous attention, at least until the se¬ 
curity is sold. As one large brokerage house has advertised: 

u . . . no investment decision can ever be a final one. Changed con¬ 
ditions constantly create changed investment opportunities and 
the investor must constantly appraise them . . . every day that you 
own and hold 50 shares of Typical Manufacturing, you say m effect: 
‘I’m satisfied it’s the best investment I can make of my money. 
(Merrill Lynch, Pierce, Fenner & Smith.) 

There are many business situations which call for continuing de¬ 
cisions. A commonly occurring problem requires a decision on a proc¬ 
ess or a system in one's company—a decision on whether to modify 
the process or to allow it to continue operating as is. 
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Example 11.2 Consider any manufacturing company which sets 
up a process to produce a product. It is important to design such a 
process so that it is capable of producing a relatively uniform eco¬ 
nomical, and useful product. Hence, the design, development, and in¬ 
troduction of the production process involve important decisions. 

However once a process is introduced, an equally important prob¬ 
lem arises It is caused by the recurring possibility that the quality 
level of the process is deteriorating. Hence, the following question 
must be answered at regular intervals: “Should we let the process 
continue to run as is, or should we examine and possibly modify it?” 


Example 11.3 A personnel administrator of a large company 
periodically considers a review of his subordinates’ personnel deci¬ 
sions (hiring, firing, promotions, demotions, ratings, etc.). The pur¬ 
pose of this review is to modify, if necessary, any trouble spots in 
personnel operations. Thus, the administrator faces a continuing de¬ 
cision problem on whether to let his department continue to operate 
as heretofore, or to review recent decisions as a basis for making 
some modification in departmental operations. 

. ^ er continuing decision problems on processes or systems arise 
m the inspection of clerical work, in maintaining a safety program, 
m maintaining inventories, in sales management, in controlling 
shortages, in records management, and so forth. Many of these 
problems are statistical, for ordinarily the wise choice of an alterna¬ 
tive course of action depends upon knowledge of some pertinent nu- 
mercial information. 

Continuing decision problems which are to be treated statistically 
obviously require not one sample study but a series of sample stud¬ 
ies. On the basis of each sample, a decision on whether or not to take 
action on a process is made. A particularly simple way of depicting 
a decision rule and the results of the series of samples for this type 
of problem is called a control chart. In this chapter we shall center 
our attention on the development of control charts for continuing 
decision problems. 

First, however, a few comments on the historical developments 
of these statistical techniques should be instructive. As early as 1924 
statistical methods were applied to the study of production processes 
by Dr. Walter A. Shewhart of the Bell Telephone Laboratories. In 
1931, Dr. Shewhart published his book. 1 on these methods, and this 
book set the pattern for the subsequent developments in the field. 
Priorto World War II most applications of quality control to pro- 

D.Va?Noftranr&Cannct Z 1931) Q ” a?/ °* UamSaCtured Pmd ^ <New York: 
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duction processes were found in a few plants in the electrical manu¬ 
facturing and textile industries and in some government arsenals. 
However, since that time there has been a rapid expansion m t e 
use of statistical quality control in all types of industries. Furt er- 
more the use of these same statistical methods has been extended 
from production process control to the control of various administra¬ 
tive and clerical operations. . 

In contrast, prior to the introduction of statistical techniques to 
check on production processes, a firm usually became aware of de¬ 
fects in its operations only after customers complained, or only after 
excessive waste, scrap, and rejects, or a sharp rise m rework time, or 
the loss of sales to competitors had occurred. In other words, ac¬ 
tion on the process was taken only after damage to the product and 
perhaps to the firm had taken place. Furthermore, when product 
deterioration was discovered, haphazard trial-and-error methods 
were used to take action on the process. The introduction of Shew- 
hart’s statistical methods replaced the haphazard procedure of the 
past with a scientific, objective, and simple and economical continu¬ 
ing decision procedure. It is now possible to detect deterioration m 
the process before the end product has become defective. 

11.2 Process Variability 

Many continuing decision problems, as noted, deal with the ques¬ 
tion of whether or not to take action on a process. Hence, they re¬ 
quire analytical studies, and the population under consideration 
should be regarded as infinite. In addition, the population of inter¬ 
est is studied not once but at regular intervals. 

To illustrate the nature of the problem, consider a firm which 
produces a film-coated wire. The aim of the firm is to keep the thick¬ 
ness of the coating uniform. Complete uniformity, however, is not 
possible for variability is a rule of nature. Hence, the population of 
thicknesses of the film coating must be characterized by some varia¬ 
bility. Thus, from hour to hour the thickness of film on the wires will 
vary simply because of the variability inherent m the process. If, in 
addition, the nature of the process itself changes m some way (that 
is the process shifts), there will be additional variation from hour to 
hour in the thickness of the coatings. Since some variability is to be 
expected, a question continuously arises as to whether or not any 
observed change in the quality characteristic may be disregarded as 
inherent in the process. The question must be answered because if an 
observed change is actually due to a significant process change, the 
process should be investigated. 
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To introduce the statistical methods used to analyze the causes 
of process variability let us consider a simple illustrative situation. 

Example 11.4 A friend of the authors is a professional golfer and 
keeps a record of his golf scores. Recently, he became displeased with 
his performances. He had a feeling that something had “gone wrong” 
with his game and thought his game should be analyzed by another 
professional golfer. He consulted the authors for an opinion. They 
grouped the golfer's scores for the last 125 rounds played (on his 
home course) into 25 groups of 5 consecutive rounds, and computed 
the mean of each group. The group means are plotted in Figure 11 . 1 . 

Figuie 11.1 is a control chart. The solid horizontal line represents 
the average score for the 125 rounds played. The two dotted lines that 
are labeled UCL (upper control limit) and LCL (lower control limit) 
are drawn at a distance of 3 standard errors of the mean from the av¬ 
erage score of the 125 rounds. 

An examination of the chart led the authors to conclude that an 
analysis of their friend's game might not result in any improvement. 
How was this conclusion reached? The answer to this question will be 
postponed. However, after reading the discussion that follows, the 
answer should be apparent to you. 


Figure 7 7.7 

CONTROL CHART FOR MEAN SCORE OF FIVE ROUNDS OF GOLF 



You will observe that the means of the sets of 5 rounds vary. We 
can consider golf playing as a process. The population consists of the 
infinite number of scores for 18 holes of golf that the golfer is capa¬ 
ble of producing. Each group of 5 scores is, therefore, a sample from 
such a process. 
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Let us now consider, in a general way, why the golf scores vary. 
The causes for this variability can be classified into two broad cate¬ 
gories: (a) common causes, and (b) special causes. Instead of the 
terminology “common-special” causes, statisticians also employ the 
terms “unassignable-assignable,” “uncontrollable-controllable, and 
“chance-significant” causes. We shall use these terms interchange¬ 
ably. 

Special causes of variability are those that bring about changes m 
the inherent nature of the process. Since this is so, they can usually 
be identified, and, hence, remedial action may be taken to remove 
them. Thus, the golfer might not be following through on his swing 
regularly; he might be shifting his grip; his stance might vary from 
swing to swing. All of these factors would tend to introduce varia¬ 
bility in the golfer’s game. These factors, however, could all be de¬ 
tected by a professional golfer, and attempts could be made to cor¬ 
rect them. . 

But, even if every swing were in keeping with sound practice, the 

scores would still vary from round to round. They would vary be¬ 
cause of the common causes influencing the golf-playing process. 
Common causes may be subdivided into two groups: variable and 
constant. The variable causes usually consist of a host of factors 
whose influences are unidentifiable. No doubt there are very many 
such factors, and the effect of any one of them must be very slight. 
In fact, the influence is so slight that if one factor were removed, its 
absence would not be felt. If an additional factor were added, its 
presence similarly could not be detected. The aggregate effect of 
these variable common factors, however, is to introduce a pattern of 
variability into a process. 

The constant common causes have a uniform impact on each unit 
(round of golf). Thus, an injury during childhood might have re¬ 
sulted in the favoring of certain muscles by the golfer when he 
swings This might weaken the power of each drive and might result 
in a constant effect of adding 2 strokes to each round played.. Such 
constant factors, since they affect each round uniformly, obviously 
do not influence the variability of the scores. They do affect the 
average or level of the scores. Thus, if the cause of the constant ef¬ 
fect of 2 strokes per round were discovered and corrected, the aver¬ 
age score would drop from 72 to 70. But, the variability about the 
average would remain the same. These constant common causes are 
inherent in a given process and are generally difficult to identify, 
especially if the process was efficiently designed. 

What is the relationship between the cause systems making for 
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process variability and the decision-making process? Obviously, if 
the variability of a process is such that its behavior can be attributed 
to common causes, it is best to leave the process alone. The process 
is simply operating within the limits of its inherent capability. To 
search for special causes under such circumstances would be costly 
and wasteful, and might even result in injecting trouble into a proc¬ 
ess in which no trouble exists. However, if the behavior of a process 
is such that the presence of special causes is revealed, then action to 
look for trouble should be taken. 

Since the decision on the process is to be based on sample observa¬ 
tions, risks of incorrect action exist. The nature of the possible ac¬ 
tions and risks involved in decisions on process variability are as fol¬ 
lows : 


Action 

State of the World 

No Trouble Exists 
(Common Causes 
Affect Process) 

Trouble Exists 
(Special Causes 
Affect Process) 

Leave process alone. 

Investigate process. 

Correct 

Incorrect 

Incorrect 

Correct 



The control chart shown in Figure 11.1 presents graphically a de¬ 
cision rule for determining the appropriate action on the process. 
The chart is interpreted as follows: If sample means fall between 
the lower ( LCL ) and upper (UCL) control limits, leave the process 
alone; if any sample mean falls on the control lines or outside of the 
control lines, investigate the process. Since the golfer’s average 
scores fell within the control limits, it was decided that no action 
need be taken. In other words, it was concluded that the variability 
in his game was due to common causes—i.e., his game was in statis¬ 
tical control . The variability in the scores was attributable to the in¬ 
herent nature of his playing—and nothing had “gone wrong” with 
his game. 


11.3 The Control Chart for the Proportion Defective 

We shall now direct our attention to the construction of the most 
common types of control charts. In this section we shall consider one 
of them—the control chart for the proportion defective (called a 
p-chart). The sample proportion of defectives is used as a basis for 


decision on a process when the quality characteristic under consid¬ 
eration is described as an attribute—usually as defective or nonde¬ 
fective. Such situations occur, for example, when radios are checked 
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to see whether or not a wire is soldered to the proper binding post; 
when books are examined to see whether or not any pages are miss¬ 
ing; and when attendance records are checked to see whether or not 
an employee is absent. 

To facilitate the succeeding discussion, let us consider the follow¬ 
ing illustration. 

Example 11.5 A manufacturer of automatic dryers decides to 
use a control chart to check on the process of shaft and washer assem¬ 
bly-one stage in the manufacture of his product. Each sampled as¬ 
sembly is classified defective or nondefective. Hence, a p-chart is 

called for. ...... ,, , 

Table 11.1 presents the proportion or fraction defective shaft ana 
washer assemblies during the month of April in samples of 1,000. (It 

Table 11.1 

INSPECTION RESULTS FOR SHAFT AND 
WASHER ASSEMBLIES 
April, 1958 

(1,000 Assemblies Inspected Each Day) 

Date Number Defective Proportion Defective 


1 . 

28 

.028 

2. 

42 

.042 

3. 

36 

.036 

4 . 

40 

.040 

5. 

35 

.035 

6. 

37 

.037 

8. 

33 

.033 

9. 

41 

.041 

10. 

40 

.040 

11. 

44 

.044 

12. 

39 

.039 

13. 

42 

.042 

15. 

45 

.045 

16. 

28 

.028 

17. 

29 

.029 

18. 

25 

.025 

19. 

31 

.031 

20. 

43 

.043 

22. 

60 

.060 

23. 

37 

.037 

24. 

76 

.076 

25. 

31 

.031 

26. 

30 

.030 

29. 

61 

.061 

30. 

47 

.047 































Ch. 11] 


THE CONTROL CHART 


339 


is customary to use the term “fraction defective” rather than “nro- 
portion defective” m discussions of process control) 

Note that even if the 1,000 assemblies examined on any day were 
to constitute the entire day’s output, they would still constitute a 
sample. Process control involves analytical studies, and a dayl out- 

conside 0 ration. SamP “ ® mfinit6 population of assemblies under 

defI?Hve en Th al Hne ° f a r, Chart iS S6t at '' the process Proportion 
defective. The upper and lower control limits are placed at a dis¬ 
tance of 3 standard errors of a proportion above and below the cen¬ 
tral line respectively. These limits are, therefore, referred to as 
8-signw limits. If a sample proportion falls within the 3-sigma lim¬ 
its, it is indicative of the result of common causes and no action is 
taken on the process. If a sample proportion falls on or beyond the 
3-sigma limits, action is taken to look for special causes. Since this 
decision is based on sample observations, risks of incorrect action 
exist-nsks that we may measure, and that we shall discuss below. 

o determine the control limits it is necessary to know the popu¬ 
lation proportion. Although the process proportion defective is not 
known exactly, it can be estimated on the basis of the 25 000 obser¬ 
vations gathered in the 25 samples. In setting up control charts it 
has been found, from experience, that 25 samples, as a rule are 
needed to obtain a sufficiently precise estimate of the decision-pa 
rameter However, if it is too costly to wait until sufficient data have 

been gathered, fewer samples may be used to construct the initial 
control chart. 

We know that an unbiased estimator of the population propor- 
tion is 1 


7T = p — Number of D efectives 

Total Number of Items Tested . 
If we apply this to the data in Table 11.1 we find 


1,000 
25,000 
- .040 . 


We can now compute an estimate of the standard error of the 
proportion for samples of size 1,000 from the estimate of ?r: 


& p 


4 

4 


7r(l ~ 7r) 

n 

C040)(,960) 

1,000 


.0062 . 
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The upper and lower control 

LCL = It - 3a„ 

= .040 - 3(.0062) 
= .021 


limits can now be calculated. 

UCL = t + 3£ P 

= .040 + 3(.0062) 

= .069 


Figure 11.2 represents the control chart—the graphic representa¬ 
tion of the decision rule. The possible values of the sample propor¬ 
tion defective are indicated on the vertical axis, and the samples are 
identified on the horizontal axis by the date on which they were se¬ 
lected Horizontal lines are drawn to represent the process average 
and the upper and lower control limits. The observed sample propor¬ 
tions are plotted above the appropriate dates. 


Figure 11.2 

CONTROL CHART FOR PROPORTION DEFECTIVE FOR 
cuact Akin WA^HFR ASSEMBLIES 



We now must check to see if all our sample proportions fall be¬ 
tween the upper and lower control limits. If they do, then our con¬ 
clusion is that the process variability is due to unassignable causes 
and we do not look for any trouble. In our illustration, however, t e 
samples taken on April 22, 24, and 29 indicate the presence of as¬ 
signable causes. The control chart, therefore, indicates that the proc¬ 
ess is not in statistical control. Before looking for trouble, however, 
it is desirable to check the inspection reports to make sure that no 
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clerical errors were made in reporting the number of defective as¬ 
semblies. 

We have, thus, observed one use of the control chart, i.e., to deter¬ 
mine whether or not a process is in statistical control. The process in 
our illustration is not in control. If a process is not in control, it is 
not possible to predict the pattern of its future behavior, except to 
say that it is likely to stay out of control. The control chart will be 
descriptive of the behavior of sample proportions only if the proc¬ 
ess is acted upon by common causes. Therefore, in order to use the 
control chart for maintaining control it is first essential to secure 
statistical control for the process. 

A check must be made to determine the nature of the special 
causes that seem to be affecting the process and to correct them, if 
possible. It is also necessary to drop the observations for the days 
when the process was out of control and re-estimate the process 
proportion defective. We are interested in the proportion for a proc¬ 
ess in control. Data based on observations when the process was not 
in control would distort the results. 

Example 11.6 The observations for April 22, 24, and 29 have been 
excluded, and the process proportion defective has been re-estimated 
below: 


7T = 


SU3 


22,000 

= .037 . 


The revised standard error of the proportion now is 

*, = JhfZh 

\ n 


/(.037) (.963) 
\ 1,000 
= .0060 . 

The revised control limits are 


LCL = T - 3o- p UCL = fr + 3& p 

= .037 - 3(.0060) = .037 + 3(.0060) 

= .019 = .055 . 

In Figure 11.3 the revised control chart is drawn and the sample 
proportions are plotted. You will observe that now all sample pro¬ 
portions fall between the control limits, and we, therefore, con¬ 
clude that the process was, for those days, in control. 

Note again that it is essential for a process to be in control before 
we can use the control chart as a guide for future action on the proc- 
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ess. If the process is out of control, the causes for the excessive 
variability must be sought out, must be corrected, and the process 
must be brought into control. Statistical theory enables us to predict 
the future behavior of a process only if it is a random process. It tells 
us that the same percentage of results will occur within given limits 
as long as the chance system prevails. 

How do we use a control chart, once the limits have been set? 

Example 11.7 Let us assume that during May, 1958, 1,000 as¬ 
semblies were inspected daily with the following results: 

Fraction 

Date Defective 

.S 

.034 

..042 

..016 

You will observe in Figure 11.3 that for the first 4 days the observed 
proportion defective fell within control limits. This indicates that the 
process seems to be in statistical control and the action is to leave the 
process alone. On May 7th, however, the proportion defective is be¬ 
low the LCL . This indicates the possible presence of an assignable 
cause. The action is to look for the cause of the trouble. 

In general, therefore, our rule for the continuing decision problem 

Figure 77.3 

CONTROL CHART SHOWING REVISED LIMITS FOR PROPORTION 
DEFECTIVE SHAFT AND WASHER ASSEMBLIES 
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is: If the observed proportion defective falls between the control lim- 
i s, eave the process alone. If the observed proportion defective falls 
on or outside either limit, look for assignable causes. 7 

. The q uesti on as to why we set lower control limits for the propor¬ 
tion defective may have occurred to you. Why do we look for 
trouble, as m the above illustration, when our process behaves as if 

1 -^ W +rf Slgl \ lfic f, ntl y better? In the first place, when a point is out- 
si e e con ro limits, it indicates that assignable causes are operat- 
mg on the process. The pattern of variability is no longer random. 

pon investigation it might be discovered that the unassignable 
cause that made for a significant decrease in the proportion defec- 
ive might at a later time result in a sharp increase in the propor¬ 
tion defective. In the second place, if the assignable cause is found 
to be a factor that will make for improved production, then the proc¬ 
ess may be modified to incorporate that factor. In either event the 
process must be brought under statistical control before we can con¬ 
tinue to use the control chart as a basis for action 
The preceding discussion has stressed industrial applications of 

ofthet^h t 6 are ’ however > man -V administrative applications 
of the p-chart. Two illustrations follow: 

Example 11.8 Alden’s Inc., is a mail-order house that has been 
app ymg control chart techniques to its operations for many years 
One such operation involves the process of filling orders in one of the 
merchandise departments. Every item that is ordered by a customer 
is selected from stock on the basis of its catalogue number color size 
and quantity. After the items are selected, they are checked against 
the order and placed on a conveyer belt to a gravity chute Several 
times during the day 100 work units are selected at random from the 
® hat , ea “ d ms P ected - A work unit is considered to be incorrectly proc¬ 
essed if any of a. set of specified error possibilities including catafogue 
umber, color, size, quantity, and price are in error. The proportion 
of incorrect units is posted on a control chart as quickly as possible 
When a point is out of control, remedial action is taken immediately.* 

_ Example 11.9 The control chart technique is used by United Air 
Lines to control the accuracy of recording plane reservations 

AbomTnm^Il^K LmeS PkneS ' S controlled at th e Denver office. 
About 10,000 telephone messages are received daily. The incoming 

telephone messages are recorded manually on message slips The tele! 

phone wires are tapped 3 times every day, and 200 consecutive mes- 

agffinsTIhe 6 ! 0 H' C 2?° n 1 ? 0py ° f the messa S e ^ then checked 
_against the recording. The slip may be in error with respect to flight 
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number, date, stations involved, number of seats, or the recording of 
information that did not enter into the telephone conversation II one 
or more of the possible errors occur, the slip is counted as defective. 
The control charts employ 3-sigma limits, and the process average is 
0.5 incorrect messages per 100 transcribed. 3 


11.4 Control Charts for the Mean and the Range 

When a quality characteristic such as the diameter of a cylinder, 
the tensile strength of a piece of steel, or the resistance of a coil 
can be measured, the mean may be used as a basis for action on a 
process. The mean alone, however, is not sufficient to indicate the 
quality of such processes. 

Example 11.10 Two samples each consisting of 4 cyclinders are 
drawn, and the inner diameter (in inches) of each of the 8 cylinders 
is measured with the following results: 


Sample 1 

Sample 2 

1.52 

0.50 

1.49 

1.10 

1.48 

2.30 

1.51 

2.10 

xi = 1.50 

z 2 = 1.50 


Thus both samples yield means of 1.50 inches, and both might in¬ 
dicate that the process level is in control. However, the variability 
for sample 2 is much greater than for sample 1. The cylinders m sam¬ 
ple 2 are all either too small or too large, even though their average 
inner diameter indicates control. 


It is, therefore, necessary to use a control chart for the mean 
(x-chart) in conjunction with a control chart for process variability. 
The criteria for variability that are used are the range (#-chart) or 

the standard deviation (s-chart). 

The two charts, one for the level and the other for variability, 
used in conjunction, supplement each other in searching for assign¬ 
able causes. Changes in process level usually require different correc¬ 
tive actions from changes in process variability. Shifts in process 
means reflect changes in all pieces made. Such factors as a gradual 
increase in temperature, tool wear, different methods used by work¬ 
men on different shifts, or a new batch of material of greater hard¬ 
ness might result in significant shifts in the process mean. Signifi¬ 
cant changes in process variability, on the other hand, might be 
traced to such factors as worn bearings, variability in pieces received 

‘ rj“s. Brinkman, “United Air Lines Speeds Reservations,” Paper-Work Simpli¬ 
fications, No. 16, 1949, pp. 9-10. 
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from a previous operation, a loose part, or a lack of concentration 
on the part of an operator. In general, the factors that affect all 
i ems result m shifts m the process level, while the factors that af- 
ect process variability are operative intermittently and, hence af¬ 
fect some items and not others. ’ 

Let us now consider the construction of an Dehart for the 
described in the illustration that follows. the process 

Example 11.11 A cannery fills No. 303 cans with corn bv a 
machine process. It wants to set up a control chart so th»+ 
drained weight in the cans is at a given average levd BeW T 
control chart can be used as a guide for future production control we 
must determine whether the process is operating under statistical 

Table 7 7 .2 

DRAINED WEIGHT OF CONTENTS OF SIZE NO 303 CANS 
OF GOLDEN SWEET WHOLE KERNEL CORN 

_________ (Weight in Ounces) 


Sample 

No. 


Date Hour 


Weight in Each of 5 Cans 
of a Sample 
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. . Jo the drained weights of corn vary because of un- 

control That is do the 5 \ ime per iods during the day a 

iz, « t « = ^ " d tr 

S'„“ S «h= initial »n.,ol chart. The <*-»«<»* »" 

recorded in the Table 11.2. 

The central line in the x-chart is set at an estimate of ^the 
nr JceL average. The upper and lower control limits are the custom¬ 
ary 3-sigma limits—in this instance 3*» since the sample mean is 

a**. ^ * ~ ry ? est r te b the 

proeess^verage and the process standard deviation from the obser¬ 
vations obtained in the 25 samples. The estimate of the process aver¬ 
se itL average of all the sample observations. Since al samples 
are 5 te ^Ime size, we may simply average the 25 sample mean. 


A = 


in 


where m is the number of samples. Hence, 

425 
p “ 25 
= 17.0 ounces . 

The standard deviation of the population is usually estimated 
from the mean of the 25 sample ranges. In Example 7.9 we saw t a 
an estimate of the population standard deviation is given y 

0 ~ di ’ 

where R is the average of sample ranges and d 2 a constant which 
depends on sample size. (Values of d 2 for selected sample srzes ap- 
pear in Table 11.3.) 

Table 7 7.3 

SELECTED FACTORS USED IN CONTROL CHART 


Sample Size 

ck 

A 2 


Di 

o . 

1.128 

1.880 

0 

3.267 

9 575 


1.693 

1.023 

0 

A 

2.282 

9 115 


2.059 

0.729 

0 

K . 

2.326 

0.577 

0 

£.1X0 

9 004 

A . 

2.534 

0.483 

0 

& .UVJt: 

1 924 

7 . 

2.704 

... 

0.419 

0.076 
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The estimate of the standard deviation for the canning process is 

„ = 2.82 
° 026 
= 1.21 ounces . 

size is small, the loss in precision of iho Jt !’ , ? th Sample 

fro ” ^~^r hti0 ” 

e upper and lower control hmits for the z-chart are: 

B ut LCL = fi- Z&i UCL-H + 3*. 


&* = 


Hence, 


& 

\ / n 

= L?1 

= .541 . 


LCL 


17.0 - 3(.541) 
15.4 ounces 


UCL 


17.0 + 3(.541) 

18.6 ounces. 

lowing relationship: P P ’ These table « are based on the fol- 


But, 


Therefore, 


3<te = ~ 
vn 




3 &x — 


SR_ 

do^/n 


i te samples <* a given size. I 

4,= * 

d 2 Vn 

to C " PnM Of » and 

may. therefore* b^m'itteii^’ ^ forml,,as for the centre! limits 
LCL = jx~ Aji 


UCL — £ + AJi . 
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If we apply these formulas to the canning process data we obtain, 


LCL = 17.0 - (0.577)(2.82) 
= 15.4 ounces 


UCL = 17.0 + (0.577) (2.82) 
= 18.6 ounces. 


The results, as expected, are the same as those computed without the 
use of the A 2 values. 

The process average line and the 

lines are drawn on t e con ro p t ] ie sample means 

the 25 sample means are also plotted. Sinceall me 

»ow”o te control eh„. of the range. The ^ hne « 

the chart is equal to R. As, heretofore, we shall set the contro r 
at a distance of 3<r B from R. Thus 
LCL = R - 3<tr and 
It can be shown that 

(TR = <Tw X O' > 

.here ... is the stend.rd devmtio, oi: the rehrUve 

write 


UCL = R + 3ff« . 


&R — <Tw 


R 

d% 


3<Ttc^ 


We may, therefore, write the control limits as 

LCL-S-T^ TOJ.-S + -J, 

given sample size. Thus, we define 

(t-^cnd !>.-(> + %=)- 

Since, the range can never be lee, then aero, D, i« given . ™lue of 
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zero whenever the above expression for D 3 yields a negative value. 

The control limits for the range may now be written as 

LCL = DM and UCL - Djl . 

Values of D s and Z) 4 for selected sample sizes are shown in Table 
11.3 (p. 346). The control limits of the i^-chart for the canning proc¬ 
ess illustration are 

LCL = (0)(2.82) UCL = (2.11)(2.82) 

~ b = 6.0 ounces . 

The control limit lines are shown in Figure 11.4, as are the values 
of the 25 sample ranges. The chart indicates that the process is in 
statistical control, with respect to variability. Since both the f-chart 
and -chart indicate statistical control, they may be used as guides 
for taking future action on the process. Note carefully that both 
charts must indicate control. If either chart or both charts indicate 
a lack of control, the process must be investigated and brought into 
control. 


Figure 11.4 

CONTROL CHARTS FOR MEAN AND RANGE FOR DRAINED WEIGHT OF 
CONTENTS OF SIZE NO. 303 CANS OF GOLDEN SWEFT KERNEL CORN 


19 


o 

i 18 


17 


£= 16 


15 


x- CHART 
UCL 


?= 17.0 


LCL 


■L-jL- L-l l I * ■ ^1—I—I—I—I—I—t~J—I—1_L 

5 10 


15 
SAMPLE 


20 


25 


~ 8 
CO 

tu 

2 

8 6 

UJ 

2 4 
< 
ac. 

UJ 

•—I «, 
Q. 2 


R-CHART 
UCL 


• • 


■■m v. 


£- 2.8 


[ A QL i —LJ—t— 1 _l.. I » . 

5 10 15 20 25 


SAMPLE 
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The decision on whether a p-chart or a combination of x- and 
E-charts is appropriate is not always clear cut. Thus, a company 
checks on a process of cutting steel rods of a given length. It may 
check the quality of performance by means of go and not-go gages. 
That is, if the rod is too short, it will go through one gage. On the 
other hand, if the rod is too long, it will not go through a second 
gage. Every rod that is tested by the gages is classified as defective 
or nondefective, and the p-chart is appropriate. 

The rod, however, could be checked by actual measurement of its 
length—for example, to the nearest tenth of an inch. In such in¬ 
stances a combination of the &*-chart and E-chart is appropriate. 

What considerations determine the choice of the type of control 
chart in such cases? For one thing, the p-chart is not as sensitive a 
measure for detecting shifts in the nature of a process as the combi¬ 
nation of x- and E-charts. Samples of 4 or 5 are usually sufficient 
for mean and range charts to detect shifts in process level or process 
variability. A much larger sample is needed for a p-chart. 

Example 11.13 The length of a 10-inch rod is checked by go 
and not-go gages. If the process proportion defective is 1 per cent, 
a sample of 4 would fail to detect a shift to 10 per cent defec¬ 
tive approximately two thirds of the time. That is, about two thirds 
of the time samples of 4 would have no defective rods even though 
the process was turning out rods 10 per cent of which were defective 
If a-, the proportion defective, were equal to .10, the probability or 
selecting a nondefective rod would be .90. Hence, the probability of 
selecting 4 nondefective rods in a sample of 4 would be 

.90 X .90 X .90 X .90 = .6561 . 

Thus, about two thirds of the time samples of 4 would indicate that 
there is nothing wrong with the process even though as many as 10 
per cent of the rods produced were defective. 

While x- and -charts are more sensitive, they usually are more 
costly to construct than 71 -charts. More skilled and better trained 
inspectors are needed to make measurements than are needed jio use 
go and not-go gages. The p-chart may be used alone; the x- and 
J?-charts must be used in combination. Furthermore, if more than 
one quality characteristic must be measured, one p-chart might 
suffice while separate x- and E-charts would be required for each 
characteristic. Thus, if we were required to control the length of the 
rod, the diameter of the rod, and its breaking strength, a single 
p-chart might be used. If either the length, diameter, or breaking 
strength were not proper, the rod would be called defective. If 
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*' and f-^arts were used, three pairs of charts would be needed to 
CO h- n° u he ength ’ dlameter > and breaking strength. The decision on 

ptd 0 : s tr in sit r ions where a chdce ?s 

pe id on the balancing of costs as against possible savings 

The s- and ^-charts have many uses in addition to the control of 
production processes. Two such illustrations follow. f 

wofk'hS it fi ™ d6SireS t0 COntro1 the amount overtime 

purpose. US6S 0 combination o f *- and E-charts for this 

single^mnM STT ^ of bou /! “f overtime worked constitutes a 

n . Mon ■ Tues - Wed. Thurs. Fri 

Overtime hours: 40 94 120 80 52 

The^ample mean (.7) is 77.2 hours, and the sample range (R) is 80 

With these basic data, control charts are constructed 1 

manner with 3-sipia limits. Whenever C! 

of overtime, or the range of hours, or both falls^utsWe of contro 

indert a L S „ Pe Af T ° f ^ ° Vertime ^at week is 

unoertaKen. At times out-of-control points may be exDlained 

rush orders or a breakdown of machinery during regular Wi W 

c° UrS \ otber times it might be found that excessive overtime was 
caused by poor production scheduling. overtime was 

Example 11.15 A cost accountant employs x- and E-charts to 

p."., h.r;r k w “' r*“ in * n ” hi ” »!>»*“"» “fc 

for thcfn l machines. Every day the average weight of scran 

Thete fo!m The^asi? dT d IJ"** in Wdghts (E) is determined, 
xnese lorm the basic data for the control charts. 

The charts are prepared and used in the usual manner. 

11.5 The Control Chart for Number of Defects 

There are occasions when neither the combination of the s- and 
process! ^ ^ “ bases for dedsions on a 

Example 11.16 A manufacturer of television cabinets desires to 
control his production process. To do so he counts the number of de 
fects. Defects include scratches, nicks, dents, unpainted portions noor 
joints and so on. The defects can be counted, but they cannot beex- 

totaTn.fY pr °]? 0rtlon because there is no method of determining the 
total number of opportunities for defects. S 

cidfnTmte e Th; J L A C01 T7 d f ir6S t0 “ aintain contro1 of its ar¬ 
dent rate. The attainment of safety may be looked upon as a ptoc- 
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An accident is a defect in the process. Again, we can count the 
number of accidents, but we do not know the denominator that is 
needed to express the number of accidents as a proportion. 

The basis for decision on a process in situations such as those 
described in the last two examples is the control chart for the num¬ 
ber of defects—usually called the c-chart. The central line on the 
ft“»ver, S e number of Meets, C. Thue, if » control oho,, 
were to be used to check the process of assembling an airplane wing 
C would represent the average number of defects per wing or a 
wings that might be assembled by the same process g 

limits are set at ±3*. from C. The question now is. How do we 

estimate C and <r c ?” 

We may say that 

C = nn . 

That is C is equal to the probability that a defect will occur multi¬ 
plied by the number of opportunities for such a defect to■occur.. n 
the case of the airplane assembly, we may assume that n, the op¬ 
portunity for a defect to occur, is large—a wing does encompass a 
wide area We can further state that w is very small-otherwise the 
assembly process would not result in useable wing assembles. We 
may make Buch assumptions even though we do not know the exact 
value of either n or We can, however, estimate their product- 

'^Fo^any given wing assembly we can say that the number of 
defects (c), 

c — np . 

The expected value of np is n». Hence, the expected value of c is 
equal to C. Furthermore, the standard deviation of np is 

a„ p = vW(l - ir) 

= VW — nir 2 . 

But, when - is small, nS is very small-very close to zero-and 
may be disregarded. Hence, we may write 


Tup 


= Vmr or VC 


We may conclude, therefore, that the sampling distribution, of 
c is approximated by a distribution for which E(c) is equal to C 
in addition for which, v c is y/C. Since the mean and the variance are 
equal, it is necessary only to estimate C to construct a contro c ar 
for the number of defects. 
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Example 11.18 The number of accidents in a plant varies from 

wnrV H° We ® k ' number of workers employed and the hours 
w rked remain fairly constant from one week to the next Thus the 

+ t CCldC f ? mains con stant. The firm wants to use a con- 
1 chait—the c-chart—to determine when and when not to take ac- 

tion on its safety program. 

Since it would be impractical to wait 25 weeks to obtain 25 points 

initifdlv tn U + y r T f S , et t P 8 C ° ntrol chart > 13 P° in ts are used 
initiaUy to set up the control chart. The number of accidents is de- 

reported as CaS6S ' F ° r ^ ^ 13 W6eks ’ these are 


Week Ending 

Jan. 9. . . . 

Number of 
Accidents 

1 “3 


16. 

19 


23. 

9 


30. 

15 

Feb. 

6. 

11 


13. 



20. 

17 


27. 

• • • 15 

Mar. 

6. 

17 


13. 

1 ^ 


20. 



27. 

Id 

Apr. 

3. 

12 

Total. 

.182 


The average number of accidents per week is estimated from the 
sample data in the preceding example as follows: 



Total Number of Accidents 
Number of Weeks 

182 

13 

14.0 


The upper and lower control limits 
lows: 


may now be determined as fol- 


LCL = C-3Z c VCL = 6 + 3? 0 

= C - 3Vq = C + 3Vq 

~ 14 ~ 3V] 4 = 14 + 3VI4 

= 14 - 11.2 = 14 + 11.2 

= 2 - 8 . = 25.2 . 

The c-chart for the number of accidents is shown in Figure 11 5 
You will observe that all points fall within the control limits. In 
other words, the number of accidents per week vary because of com- 
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mon causes, and the safety program of the firm requires no special 
investigation. Furthermore, the firm may project the control chart 
into the future to check on its safety program. 


Figure 7 7.5 



Let us now suppose that the following data are compiled for the 
ensuing 10 weeks. 


Week Ending 
Apr. 10. 


Number of 
Accidents 


17, 

24 


1 

10 

.27 


May 1 
8 

15 

22 

29 


12 
10 
8 
16 
, 7 


June 5. ^ 

12. 4 

For the week ending April 10, the control chart in Figure 11.5 in¬ 
dicates that a point was below the lower control limit. The ma er 
was investigated, and it was found that minor accidents were not 
reported by a newly hired plant physician. The physician was, there- 
fore, instructed as to proper accident reporting procedures. 

For the week ending April 27, a point was above the upper control 
limit. This time an investigation revealed that workers were neglect¬ 
ing to wear safety glasses. The matter was corrected. 


11.6 The Risks of Incorrect Action 

The control chart, as we have seen, represents a statistical de¬ 
cision rule for determining whether or not to take action on a proc- 
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riskfoTt f C ° ntr01 ° hart makeS USe of sam P le data , the following 
risks of taking incorrect action invariably exist: g 

a) SVSk ° f taking a ° ti0n 011 the pr0cess if no trouble exists- 

5) JxMetS takinS aCti ° n 0n the Pr ° cess whe * t™ble does 

A decision rule may be determined by setting the a-risk and /J-risk 
hat we can tolerate relative to two possible states of the world Ind 

^rolVmtTfT * 6 r Uired ^ ^ and Critical valuedcS 

trol limits—of the pertinent statistic. An alternate method of deter- 
mmg a decision rule is to set the sample size and the a-risk and 
hen determine the critical values. It is this latter course that is fol 
lowed m the construction of control charts. 

In practice, as you have seen, 3-sigma control limits are usually 

of d T, Pr ® fe . renc ^ t0 the dlreet specification of the a-risk in terms 
of Probabilities. The use of 3-sigma limits rests, to a large extot 
on the fact that the sampling distribution of the statistics used ^re 
° n 7 approximated by known sampling distributions. Thus the 
sampling distribution of the mean is distributed normally only if 
the population from which samples are drawn is normal, addi- 
| 01 , ! . t0 “ easure Probabilities precisely, the mean of the population 

Tconto Thtt th eVlatl ° n “ USt ^ kn ? Wn - H ° WeVer ’ “ cons tructing 
he Pr ,° CeSS mean and standard deviation are esti- 

c“ntrol lite e at e i 3 J f Ct t0 , Sampling error ‘ Thus > “ setting the 
ontrol limits at -3** from /-*, we cannot say that the probability 

oSrTthe 6 “ 6anS Wil1 M ° UtSide ° f con tr°l limits is precisely 
f 0 l 7 mr! h f° CeSS average 1S We can > however, say that the 
probability of a sample mean falling outside of control limits is very 

smafi i/ the process average is The 3-sigma limits set a low prob- 
y _ or e “-risk . its exact value is unknown in most cases. 

imi &r .. eon f. lderatlons a PPly to the other control charts. Thus 
the sampling distribution of p is only approximated by the normal 
curve even for large samples. Similarly, the normal curve offers only 

ZSTZTT th ! Sampling distribution of the number of 
defects. Also, the values for D, and D-, used for the fl-chart assume 
that the population is normal. 

It is, therefore, argued that precise probability limits are not 
needed, since the probabilities are at best approximations. Further- 
more, the use of 3-sigma limits as a standard for all control charts 
makes the technique simpler to use with plant personnel who are 
untrained in statistics. Finally, about 20 years of experience have 
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shown the practical value of the 3-sigma control limits Their use 
has made for an economic balancing of the risks of possible incorrect 

de After the control chart (decision rule) has been determined it is 
possible to calculate the probability of the second-type error leav¬ 
ing the process alone when special causes are present—for all pos¬ 
sible values of the decision-parameter. In other words, we can deter¬ 
mine the OC curve for the control chart. 

Example 11.19 Figure 11.6 represents the OC curves for the 

x-chart in the process of filling cans of corn i^lS 6 ounles 
estimated process average is 17.0 ounces. The UCL is • » 

and the LCL is 15.4 ounces. The estimated process standard devia- 

tion is 1.21 ounces. 

Figure 77.6 

OPERATING CHARACTERISTIC CURVE FOR CHARTFOR 

SAMPLE MEAN DRAINED WEIGHT OF SIZE NO. 303 CANS 

/m? r\ek.l CU/CCT KTPNFI CORN 



The solid line represents the OC curve for a single sample. That is, 
it shows the probability that a single sample drawn from the process 
will result in leaving the process alone. The curve indicates that 
the process average net weight of cans were to shift 1 ounce to 16^0 
ounces (or 18.0 ounces), the probability that the process would be 
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left alone—the shift would be undetected—is .87. The probability 
ot a shift of 2 ounces remaining undetected by a single sample would 
be only .23. The probability of not detecting a shift of 3 ounces or 
more is practically zero. In other words, a shift of 3 ounces or more 
would be detected practically always with a single sample. In gen¬ 
eral, therefore, the greater the shift of the process average from its 

desired level, the more likely that this shift will be detected by a 
single sample. 

The control chart, however, relies on periodic sampling from a 
process. Successive samples facilitate the detection of sustained 
shifts m the process average. Thus, the probability of a shift of 1 
ounce remaining undetected by a single sample is .87. The prob- 
ability that such a shift remains undetected in 2 successive samples 
is .87 X .87, or .76. In general, the probability that any shift remains 
undetected m m samples may be expressed as (P.)", where P« is the 
pro a llity of a shift of a given amount in the process average re- 

1 ? a ” 11I ! g 1 Undetected by a sin S le sample. Thus, the probability that a 
sfnft of 1 ounce remains undetected after 5 samples is (,S7) S = .50. 
That is, a shift of 1 ounce will remain undetected .50 times in 100 
with 5 successive samples. 

< figure 11 . 6 , OC curves have been drawn to show the probabil- 
ities of leaving the process alone-a shift in the process remaining 
undetected after 2 and after 5 successive samples. The chart indi¬ 
cates that successive samples result in rapid decreases in the proba¬ 
bility that shifts m the process average will not be detected Thus 
a sustained shift of 2 ounces will practically always be detected 
with 5 successive samples. 

Since samples are drawn regularly for control charts, large sus¬ 
tained shifts in the process average are almost certain to be detected 
after a few samples. The probability of detecting small sustained 
shifts in the process average also increases, but more slowly. Thus 
knowledge of the OC curve for a control chart is helpful in determin¬ 
ing the frequency with which to sample. The more important it is 

de 1 t , ec 1 t sh , lfts m the Process average, the more frequently samples 
should be drawn. , ■ 

The OC curve of the 2-chart has been used to illustrate the dis- 
cussion in this section. The same conclusions, however, are valid for 
all the control charts considered in this chapter. 

11.7 The Samples, Their Size, and Frequency of Selection 

Each sample taken from a process represents a subgroup of obser¬ 
vations. The successful operation of control chart techniques de- 
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pends upon the proper grouping of sample observations into sub¬ 
groups, the size of these subgroups, and their frequency of selection. 

Remember that the purpose of a control chart is to detect the 
presence of assignable causes of variability. To serve this purpose 
adequately, samples should consist of subgroups of output that are 
as homogeneous as possible. That is, every observation in a sample 
should reflect the influences of the same set of causes. Thus, when 
assignable causes do appear, they will manifest their existence in dif¬ 
ferences among the samples. If subgroups are chosen so that all sam¬ 
ples reflect assignable and unassignable causes, the presence of the 
assignable causes escapes detection since their effects are sub¬ 
merged. Subgroups, in other words, must be chosen so that the vari¬ 
ability among subgroups exceeds the variability within a subgroup. 

Thus, it would not be desirable to include in a sample the output 
of two or more shifts. In this way an unassignable cause in one shift 
might escape detection on the control chart. Similarly, mixing the 
product of more than one machine or more than one worker might 
conceal the assignable cause present in the output of one machine 
or one worker. Let us use an illustration to explain this point more 
clearly. 

Example 11.20 Two machines are supposed to be cutting tubing 
into 8-inch lengths. The lengths of the tubing turned out by these 
machines, to the nearest inch, are given in Table 11.4. 

Table 11.4 


LENGTH OF TUBING CUT BY MACHINES 

(In Inches) 


Time 

Machine A 

Machine B 

9:00 

3 

8 


5 

8 


2 

8 


2 

8 

10:00 

6 

8 


10 

8 


9 

8 


7 

8 

11:00 

5 

8 


3 

8 


4 

8 


4 

8 

12:00 

10 

8 


10 

8 


8 

8 


8 

8 
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The product of machine B is obviously in control since the product 
(in defiance oi nature) is perfectly uniform. The control chart of the 
mean based on 4 samples of size 4 for the output of machine A is 
shown in Figure 11.7. 


Figure 7 7.7 

CONTROL CHART FOR X FOR OUTPUT OF MACHINE A 
SAMPLE SIZE—4 



The output of machine A is definitely not in statistical control since 
2 of the 4 samples fall outside the control limits. 

Instead of taking samples of 4 from each machine separately, let us 
take 2 items from each machine to make up the sample of 4. Assume 
that the first 2 items listed in Table 11.4 are selected every hour for 
each of the machines. The control chart for the combined samples is 
shown in Figure 11.8. The chart indicates a process in statistical con¬ 
trol. 


Figure 7 7.8 

CONTROL CHART FOR X FOR OUTPUT OF MACHINES A AND B 
SAMPLE SIZE—4 
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In this simplified example, we have shown that by combining m 
one sample a process operated by random forces and one affected by 
assignable causes, the presence of the assignable cause remains unde¬ 
tected. The averaging process conceals the changing cause system 
in A. 

It is, therefore, most important in selecting samples that a ra¬ 
tional choice of subgroups be made. The choice of the proper sub¬ 
group is obviously the engineer’s job. However, the statistician must 
point out its importance for the effective utilization of the control 

chart. . . , . 

For the control charts of the mean and range sample sizes ot 4 
or 5 are used almost universally. One reason follows from our pre¬ 
vious discussion. There is greater likelihood that a smaller sample 
will be more homogeneous. Furthermore, for the same frequency 
of sampling, samples of 4 or 5 are less costly than larger sam- 

ples. 

On the other hand, larger samples give better protection with re¬ 
spect to the risk of leaving the process alone when assignable causes 
are at work. Since the standard error decreases with an increase in 
sample size, the control limits are brought closer together for larger 
samples. This decreases the range of the statistic over which the 
process will be left alone. The smaller this range, the smaller the 
likelihood of leaving the process alone by error. Similarly, the greater 
the frequency with which samples are taken, the greater the proba¬ 
bility the assignable causes will be detected. 

The nature of subgroups, the sample size, and frequency of sam¬ 
ple selection, require careful consideration to ensure effective opera¬ 
tion of control charts. There is as yet no precise solution as to how 
frequently to sample and as to how large samples should be to ob¬ 
tain maximum protection. The answer to this problem requires a 
knowledge of the costs connected with the sample size, and fre¬ 
quency of sampling, as well as the costs of undetected process shifts. 
In addition, knowledge of the probability of such shifts is required. 

11.8 Statistical Control and a Satisfactory Product 

When we say that a process is in statistical control, we mean that 
the pattern of its variability is attributable to common causes. When 
we say that a product is satisfactory, we mean that each unit of the 
product meets the specification set for it. Statistical control is de¬ 
termined through the use of control limits. These, you remember, 
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refer to the limits of chance variability for sample means, sample 
proportions, sample ranges, sample standard deviations, or the num¬ 
ber of defects in a sample. Specification or tolerance limits, how¬ 
ever, refer to the extent of permissable variability in the individual 
items. If a process is operating in statistical control, it does not fol¬ 
low that all the individual items will be satisfactory, i.e., meet spec¬ 
ifications. That is, the variability of the items, although caused by 
chance factors, might be too great to meet the requirements set for 
product. Let us consider an illustration: 

Example 11.21 Let us assume that the specification for size No. 
303 Golden Sweet Whole Kernel Corn is set as 17 ± 1.7 ounces! 
That is, the drained weight of each can should be 17 ounces, with an 
allowance of 10 per cent deviation each way. The lower specification 
is set to protect consumers. The cannery, on the other hand, does not 
want to give the consumer more than he is paying for and, hence, sets 
an upper specification. Any can, therefore, is considered acceptable if 
its drained weight varies from 15.4 to 18.6 ounces. 

The control charts in Figure 11.4 indicated that the process of fill¬ 
ing cans was in control with respect to the process average and the 
process variability. Nevertheless, if you refer back to Table 11.2 you 

will observe that 26 of the 125 sampled cans were not acceptable_12 

cans weighed less than 15.4 ounces, and 14 cans weighed more than 
18.6 ounces. We, thus, see that a process in control can produce items 
that are not satisfactory—that do not meet the specifications. 

Specification or tolerance limits are determined so that the prod¬ 
uct will be serviceable. Thus, for example, if a cylinder is to fit 
snugly over a tube, the specifications for the inner diameter of the 
cylinder must be set so that it covers the tube but does not fit too 
loosely. Some variability must be allowed for in the specifications, 
for, as we know, all production processes are subject to variability! 
What constitutes a serviceable product is a problem not for the stat¬ 
istician but for the engineer. However, the statistician can aid, as we 
shall see, in helping to determine specification limits that are re¬ 
alistic. 

What are the alternatives open to a firm when the process vari¬ 
ability is attributable to common causes (operating in statistical 
control) but when too large a portion of the individual items do not 
fall within the specification limits? An obvious answer would be to 
change the process. Since the process is in statistical control, any 
tampering with it might only make matters worse. A change in proc¬ 
ess, however, is usually a costly undertaking. Such changes might 
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include the purchase of new machinery, the design of new tools, and 
the retraining of workers. 

If the firm does not change the process, then it might, resort to 
100 per cent inspection to separate the good product from the bad. 
This also entails costs and is not possible when a destructive test 
must be used to distinguish between good and bad product. Man¬ 
agement must decide, on the basis of costs, which of the alternatives 

it is to follow. . , T ! i , , ,, , 

At times, a third alternative is available. It has been found that 

producers often set specifications too tightly for the purpose the 
product is to serve. A review of such specifications has disclosed that 
the limits might be relaxed. 

It is, therefore, preferable to determine process capabilities be¬ 
fore specification limits are set. Then, before production begins it is 
possible either to adjust (1) the specifications to the capabilities of 
the process; (2) to consider the possibility of changing the process; 
or (3) to use 100 per cent inspection and to scrap or rework all de¬ 
fective product. 

The control chart can be used as an aid in determining process 
capability. Before a process gets into production a pilot run can be 
made. The pilot run accomplishes two things. In the first place, it 
enables us to determine whether the process is in control. If it is in 
control, it enables us to estimate the process capability—a measure 
of the Variability in the individual items. 

Example 11.22 A manufacturer is to produce a temperature con¬ 
trol device. Forty devices are produced and set so that the switch is 
to operate at 68.0 degrees. The temperature at which the switch ac¬ 
tually operates is recorded for the 40 devices. The devices are grouped 
into 10 groups of 4; £ is found to be 68 degrees, and R is found to be 
0 9 degrees. The control charts for the sample mean and the sample 
range are plotted, and the process is found to be in statistical control. 
The question we wish to answer is: “What is the process capability?” 
You again observe that we could not predict process capability if the 
process were not in statistical control. 

We determine the variability in the individual devices (process 
capability) as follows: With R and the sample size (n = 4) we can 
estimate the process standard deviation (&). 



0.9 

2.059 


= .44 
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If we now assume that the distribution of temperatures at which 
the switch operates is normal, we can determine the process capa¬ 
bility. We know that all but approximately .0027 of the tempera¬ 
tures will fall within the range of - S3- to + S3. 

M - 3<j = 68 - 3(.44) - 66.7° , 

M -f S& = 68 - 3(.44) - 69.3° . 

The manufacturer wanted to set his specifications at 68° ± 0.5°. 
Do you think that the process is capable of meeting these specifica¬ 
tions ? 

You should observe that these computations do not tell you what 
the specification limits should be. They do tell you what type of spec¬ 
ification limits a process could meet. 

11.9 You Should Now Know That 

Variability is a characteristic of all business processes. 

The causes of process variability may be divided into two broad 
categories: (a) common (unassignable, uncontrollable) causes, and 
(b) special (assignable, controllable) causes. 

Common causes are inherent in the nature of a process and are 
usually difficult to isolate and to correct. 

A process whose variability can be accounted for by common 
causes is said to be in statistical control. 

Special causes usually are identifiable and can be corrected. 
A process whose variability can be accounted for by special causes 
is said to be out of statistical control. 

A control chart is a statistical decision rule presented in the form 
of a graph to distinguish between common and special causes. In 
this way, it serves as a basis for deciding whether or not to look for 
special causes. 

A control chart may be designed only for processes that have 
been brought under statistical control. 

The most commonly used control charts are: the p-chart, the 
.T-chart, the 72-chart, and the c-chart. The x- and 72-charts are 
usually used in conjunction with each other. 

Since control charts are based on sample information, two types 
of incorrect action are possible: (a) taking action on the process 
when no trouble exists, and ( b ) neglecting to take action on a proc¬ 
ess when trouble does exist. 

The probability of taking action on a process when no trouble 
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exists (a-risk) often is set implicitly by the use of 3-sigma limits for 
all types of control charts. 

The operating characteristic curve of a control chart presents the 
probability of not looking for special causes for various shifts in the 
nature of the process. 

The samples drawn for use with control charts should represent 
homogeneous subgroups of the population. The sample size and the 
frequency with which samples are drawn are also important consid¬ 
erations in assuring efficient use of control charts. 

Statistical control does not imply a satisfactory product. If a 
process in control does not result in a satisfactory product, a change 
in the process or 100 per cent inspection generally is called for. At 
times, however, specifications might be revised. 

The control chart, by determining process capability, may be used 
to predict the ability of a process to produce a product to meet given 
specifications. However, the control chart does not determine speci¬ 
fications. 

11.10 You Should Now Be Able to Solve 

1. What type of control chart(s) would you recommend in each 
of the following situations? Explain the reasons for your choice. 

a) A company wants to control the error rate in its filing system. 

b ) A company wants to control the number of imperfections in a bolt 
of cloth. 

c) A company wants to control the breaking strength of glass con¬ 
tainers. 

2. A firm manufactures bolts. It decides to use a control chart to 
control the diameters of various size bolts. 

a) Could the firm use x- and E-charts? Explain. 

b) Could the firm use a p-chart? Explain. 

c) What factors would the firm have to consider in deciding between 
a combination of x - and E-charts and a p-chart? 

3. A company decides to use a p-chart to control its process of 
invoice preparation. Random samples of 100 invoices are to be 
drawn twice daily, and the proportion of incorrect invoices in each 
sample is to be recorded. An invoice is to be considered as incorrect 
if it contains an omission, incorrect name of account, incorrect ac¬ 
count number, erasure, strike-over, transposition of figures, incor¬ 
rect quantity, incorrect unit price, incorrect extension, or incorrect 
total. To construct the initial control chart 25 samples are drawn, 
and the results are found to be as follows: 
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Date 

Sample Number 

Sample Size 

Number of 
Incorrect Invoices 

March 2 

1 

100 

3 


2 

100 

5 

3 

1 

100 

2 


2 

100 

3 

4 

1 

100 

2 


2 

100 

6 

5 

1 

100 

0 


2 

100 

4 

6 

1 

100 

11 


2 

100 

12 

9 

1 

100 

5 


2 

100 

3 

10 

1 

100 

4 


2 

100 

1 

11 

1 

100 

11 


2 

100 

1 

12 

1 

100 

6 


2 

100 

4 

13 

1 

100 

2 


2 

100 

3 

16 

1 

100 

2 


2 

100 

3 

17 

1 

100 

2 


2 

100 

12 

18 

1 

100 

3 


a) Construct the initial p-chart. 

b) Is the process of invoice preparation under statistical control? Ex¬ 
plain. 

c) What must be done before a control chart may be prepared for 
future use? 

d) Set up the p-chart for future use. 

4. Draw an OC curve for the p-chart constructed in Problem 3 
for a single sample. For 2 samples. For 5 samples. (Assume that the 
sampling distribution of p is normal.) 

5. A company wishes to check on the reliability of laboratory 
inspection results. A standard sample of 4 items is submitted at ran¬ 
dom intervals for inspection (unknown to the laboratory techni- 
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cian). The technician determines the per cent of metal by weight in 
the 4 units of the compound submitted to him for analysis. The re¬ 
sults of the first 24 submissions of the same standard sample are 
shown below: 4 


Standard Sample Number 

Per Cent of Metal by Weight 

Item 1 

Item 2 

Item 3 

Item 4 

1 . 

10.28 

10.30 

10.25 

10.25 

2. 

10.26 

10.26 

10.19 

10.30 

3 . 

10.27 

10.28 

10.25 

10.26 

4 . 

10.28 

10.30 

10.28 

10.30 

5 . 

10.15 

10.28 

10.43 

10.20 

6 . 

10.13 

10.23 

10.29 

10.37 

7 . 

10.27 

10.39 

10.28 

10.29 

8 . 

10.20 

10.29 

10.12 

10.25 

9 . 

10.25 

10.33 

10.20 

10.24 

10 . 

10.25 

10.31 

10.20 

10.21 

11 . 

10.16 

10.20 

10.34 

9.97 

12 . 

9.88 

10.13 

9.97 

9.88 

13 . 

10.13 

10.25 

10.31 

10.25 

14 . 

10.39 

10.25 

10.31 

10.27 

15 . 

10.36 

10.24 

10.31 

10.26 

16 . 

10.26 

10.29 

10.32 

10.20 

17 . 

10.24 

10.27 

10.30 

10.34 

18 . 

10.39 

10.37 

10.21 

10.24 

19 . 

10.31 

10.28 

10.38 

10.23 

20 . 

10.33 

10.26 

10.18 

10.21 

21 . 

10.21 

10.14 

10.21 

10.24 

22 . 

10.14 

10.10 

10.22 

10.25 

23 . 

10.24 

10.12 

10.13 

10.10 

24. 

10.13 

10.18 

10.22 

10.19 







a) Why are both an x-chart and an E-chart needed in the present 
situation? 

b) Set up an 5-chart on the basis of the above data. 

c) Does the 5-chart indicate that the process is in statistical control? 
Explain. 

d) What must be done before an 5-chart may be prepared for future 
use? 

e) Set up an E-ehart on the basis of the above data. 

/) Does the E-chart indicate that the process is in statistical control? 
Explain. 

g) How does your interpretation of a point out of control on an x- 
chart differ from your interpretation of a point out of control on 
an E-chart? 

h ) Revise the 5-chart and E-chart for future use. 

4 Based on Frank W. Kroll, Jr. “Effective Quality Control Program for the 
Industrial Control Laboratory,” Industrial Quality Control, Vol. XIV, No. 6 (De¬ 
cember, 1957), pp. 9-13. 





























Oh. 11] 


THE CONTROL CHART 


367 


6. A company manufactures paper that is intended to be im¬ 
pervious to oils. It conducts pinhole tests at regular intervals. A 
sheet of paper, 17 by 22 inches in size, is taken from production, and 
colored ink is applied to one side of the sheet. Each ink spot which 
appears on the other side of the sheet within 5 minutes is counted as 
a defect (a pinhole). 

The results for the first 25 sheets sampled are shown below: 


Sheet Number 

Number of 
Pinholes 

Sheet Number 

-••• __ 

Number of 
Pinholes 

1 . 

9 

14 

K 

2 . 

8 

15 . 

4 

3. .. 

9 

16 

12 

4. ... .. 

12 

17 

A 

5. 

8 

18. . 

21 

6 . 

5 

19 

Q 

7. 

7 

20 . 

tf 

A 

8 . 

6 

21 

u 

q 

9. .. 

5 

22 

V 

90 

10 . 

9 

23 . 

*AJ 

IQ 

11 . 

6 

24. . . 

1«7 

Q 

12 . 

8 

25. . 

o 

7 

13. 

7 


# 


a) Construct a c -chart on the basis of the above data. 

b ) Is the process in statistical control? Explain. 

c) How does the control chart help the firm bring its process under 
control? 

d) Construct a c-chart for future use. 

7. A process for the manufacture of steel rods has recently been 
perfected and is operating in statistical control relative to the break¬ 
ing strength of the rods. The average breaking strength (process 
average) is 600 pounds per square inch (p.si.), and the process 
standard deviation is 25 p.s.i. 

In spite of the presence of statistical control, complaints have 
been received from customers concerning the great number of rods 
that have been breaking. 

What might be wrong? What might be done to remedy the situa¬ 
tion? In your answer emphasize what statistics can do and cannot 
do as an aid in coping with the problem. 

8. A company manufactures dial knobs. The specification for 
the inner diameter of the knob is .50 ± .03 inches. 

Control charts for the mean and range are set up to control the 
dimension of the inner diameter. The estimated process average, 




























368 


STATISTICS FOR BUSINESS DECISIONS [Ch. 11 

based on 25 samples of size 5, is .506 inches, and R is .052 inches. 
The process is found to be in statistical control. 

Is the process capable of producing knobs whose inner diameters 
meet the required specifications? Explain carefully. 

11.11 You Will Also Find That 

The construction of control charts and their use are discussed 
very clearly in: 

A. S. T. M. Manual on Quality Control of Materials. Philadelphia: 

American Society for Testing Materials, 1951. 

American Standard, Zl.S—1958, Control Chart Method for Controlling 
Quality during Production. New York: American Standards As¬ 
sociation, Inc., 1958. 

Grant, Eugene L. Statistical Quality Control, chaps, i, ii, iv, v, vii, x, 
and xi. 2d ed. New York: McGraw-Hill Book Co., Inc., 1952. 

For further discussions you may consult: 

Cowden, Dudley, J. Statistical Methods in Quality Control, chaps, xvi- 
xix. Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1957. 

Duncan, Acheson J. Quality Control and Industrial Statistics, chaps, 
xviii-xxii. Bev. ed. Homewood, Ill.: Bichard D. Irwin, Inc., 1959. 

For varied applications of control chart techniques you should 
consult Industrial Quality Control, a monthly publication of the 
American Society for Quality Control. The transactions of the So¬ 
ciety’s annual conventions are also replete with illustrative mate¬ 
rials. The articles in these publications are, as a rule, nontechnical. 



CHAPTER ■ 12 


Comparative Experiments and 
Their Statistical Design 


12.1 The Nature of Experimentation 

In this chapter we shall consider some of the statistical methods 
and theory that are required to reach decisions on the basis of com¬ 
parative experiments. 

An experiment may be regarded as a special type of statistical 
study in the sense that it involves collecting information as a basis 
for action. In the statistical studies that we have previously con¬ 
sidered, observations usually are available and would be available 
whether or not a statistical study was conducted. In contrast, an ex¬ 
periment ordinarily requires that observations be created—that is, 
elementary units must be subjected to certain stimuli or influences 
and their reactions observed. Hence, an experiment usually involves 
observations on a situation created by the study itself. 

To point up this distinction between experiments and other types 
of statistical studies, contrast a study to estimate the average rent 
paid for a group of dwelling units with a study to measure the effi¬ 
cacy of a new drug. In the first situation, the observations on rent 
would be available whether or not a study were conducted. How¬ 
ever, in the second, elementary units (such as guinea pigs) must be 
subjected to the drug in order to determine its influence and, hence, 
its efficacy. J 

Many problems requiring experimentation can be decided 
through the medium of elementary statistical theory such as we 
discussed in Chapters 7 and 8. Thus, a decision problem on whether 
or not to market a synthetic plastic might require an experimeht to 
measure its average breaking strength. This situation, however, 
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simply presents a testing problem for which n is the decision-pa¬ 
rameter. Similarly, the estimation of the percentage of ammonia in 
a gas may require some experimentation, but the problem simply 
presents an estimation problem for which is the decision-param¬ 
eter. 

However, some experiments present the need for more advanced 
theory and methods. In particular, comparative experiments re¬ 
quire more complex statistical designs. A comparative experiment 
involves experimentation as a basis for comparing properties of two 
or more populations. For example, consider the experiment con¬ 
ducted a few years ago to decide on whether or not to approve the 
general use of Salk vaccine as a preventive of infantile paralysis. In 
that instance, one group of children (elementary units) was given 
Salk vaccine, while another group was not given the vaccine. The 
problem required that one population (the effects of the vaccine) 
be compared with another population (the effects of no vaccine). 
Of course, only a sample from both populations could be observed 
—the differences shown by the samples of effects served as a basis 
for decision. 

In this chapter, then, we shall consider some aspects of the statis¬ 
tical design of comparative experiments, with particular emphasis 
on applications in business. 

Some persons may associate experiments with chemistry and test 
tubes, but actually experimentation covers a far wider range of ap¬ 
plication both in and out of the business world. Thus, pharmaceu¬ 
tical, chemical, and oil companies have a natural interest in labora¬ 
tory experiments. But, they and many other types of companies also 
have an interest in such things as marketing and advertising experi¬ 
ments and in industrial experiments where the laboratory may be 
the factory or the consumer's home or the market place. Why? Im¬ 
portant decisions must be based on the results of such experiments. 
Let us consider some illustrations of business decision problems 
which require comparative experiments. 

Example 12.1 A company develops a new product—a coated 
magnetic wire. Two methods of coating the wire (that is, two proc¬ 
esses) are available for use in production. Since both methods are 
equally costly, the main criterion for a decision is an answer to the 
question: “Which method will result in the more uniform product?” 
An experiment to compare the results of applying method A. and of 
applying method B would provide a basis for the answer to this ques¬ 
tion and, hence, a basis for decision. 

Example 12.2 An advertising agency must decide which of sev¬ 
eral promotional displays is more effective. As a basis for deciding 
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which display is more effective and, hence, which should be used an 
expenment is to be conducted. Samples of’consumers are to be s£wn 
* :lf* Spla J s - the reaction ,3 of the consumers measured through inter- 

vTCs" ° ° 1Ce ° f 8 diSPky made ° n the basis thes" obser. 

upw T m? i fo A P ersonnel director is responsible for developing a 
new employee training program. The program is to include solie 

nWMu “ s ,* ru ? tl0n ’ and two methods of presenting the material are 
1 u dlrector thus faoes a decision problem—the choice of 
a “ eth t °J mstruction. One decision procedure is suggested-use 
method of instruction on a sample of employees and on the ba 

Za J > “* «wfeh 

method of instruction is the more effective. 

The statistical design of an experiment is a plan for collecting 

obser vatlons . The gtatigtical deg . gn ()f ^ 

nients is important m order to control, within economic limits the 
risks of mcorrect decisions. A good part of this chapter, therefore 
will deal with the consideration of alternative designs. 

Throughout this text we have emphasized the importance of a 
prion planning. In short, one must always remember that statistical 
problems do not begin only after the data are collected. Unfortu¬ 
nately, one sometimes finds that an experimenter collects data and 
then worries about their analysis. However, not even an expert stat 
istiman wouM be clever enough to analyze data collected haphaz¬ 
ardly. It is important, therefore, that the same type of a priori plan¬ 
ning stressed throughout our study of statistics be applied in the de- 

Ser! Sr" ^ "" 6mment BrftiSh * *»£ 

boot n appr °fi ate t0 qUOte Fisher 0n this to P ic because his 
book The Design of Experiments, which was first published in 1935 

as become the basis for the development of the subject. Originally 
applied to the field of agriculture, the principles of experimental de¬ 
sign m recent years have been extended for use in the natural sct 

dustry m 16 S0Clal SClenCeS) and in busin ess, commerce, and in- 


1 Ronald A. Fisher, The Design 
Oliver & Boyd, 1949), p. 12 . 


of Experiments 


(5th ed.; Edinburgh, London: 
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12.2 Decisions Involving the Comparison of Two Population 
Means . 

A common type of decision problem calls for the comparison o 
two populations because the consequences of taking an action d - 
nend upon the difference between the means of the populations. 

P Let us denote such populations as A and B, their arithmetic 
means as m, and p b , and their standard deviations as <r A and «■*, 
Respectively" The difference between the two population means 
may be called D. Hence, 

D = ha — f*B - 

We shall consider the type of problem for which the consequences 
of taking an action depend upon the true, but ordinarily unknow , 
value of D. Hence, we may say that D describes states of the wor d, 
and that the true value of D describes the true state of the world. 
We assume, then, that if this true difference were known action 
could be taken. Let’s look at an illustration for future, as well as im¬ 
mediate, reference. 

Example 12.4 A manufacturer of fishing 
nr, the use of one of two methods of waterproofing fishhnes. Thus, the 
alternatives Re! (1) use method A, and (2) use method B. The po^ 
SeTntorrecTdecilns are: (1) to use method A when actually B 
i u t j o-nH to use B when actually A should be used. 

»». *« -t Lw« y 

breaking strength of the line after its apphcation. Of course, the ef¬ 
fect will vary somewhat from fishlme to fishlme. 

gtc l.“ populeti... n» be 0»* M»“»" Z“.te 

the breaking strengths of fishlines treated by method A, the othe 
+ thp breaking strengths of fishlines treated by method B. 
since this problem, like almost all experiments, is analytical because 
ft calls for action on a process, both populations are infinite 
It is believed that the variability in breaking strengths 
from the use of method A would be about same as the ™mbfirty 
n breaking strengths resulting from the use of method Eh (That is t 
is believed that t A equals „».) However, there ,s uncertainty about 
which method results in the higher average breaking strength. T 
is, there is uncertainty about the value ot 

D = ha — hb • 

Some information about this value is necessary in orde "' *’ “^ide 
rational decision. Hence, an experiment is to be conducted to provide 
sample information about this difference. 

Before we attempt to formulate a statistical decision rule for this 
problem, let us consider the general nature of such an. „ 

A sample of the effects of method A and a sample of the effects 
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method B are to be observed. The average breaking strength of the 
fishlines treated by each method then may be computed. Let us de~ 
note the possible results of this experiment as follows: 



Sample A 

. 

Sample B 

Sample size. 

n A 

x A 

S A 2 

-- --- 

n B 

Xb 

SB 2 

Sample mean. 

Sample variance. 



The pertinent statistic is the difference between the two sample 
means, namely, 

d = x A — x B . 

This statistic provides partial information about the true state of 
the world about the true but unknown value of D. The choice of 
the method of treating fishlines may be related to possible values 
of a. However, this involves some risks of incorrect action because d 
represents only sample information. To measure these risks we 
therefore shall consider its sampling distribution. 

,.J lrst conslder this general idea: the sampling distribution of the 
difference m sample means, for sufficiently large samples, will be ap¬ 
proximately normally distributed. In addition, remember that the 
mean of this distribution, E(d), is 

E(d) = D . 

The standard deviation of the statistic d may be denoted as <r« The 
formula for this standard error can be derived by the application of 
a fundamental theorem of mathematical statistics: The variance of 
the difference of two independent quantities (such as x A and x B ) is 
equal to the sum of the variances of these quantities. Hence, : 

«•/ = al + **! . 

We know that the variance of a sample mean for a simple random 
sample from an infinite population is v/n. Therefore, 


, Ka n B 

and 


<?d 



In many experiments it is reasonable to assume that equals <r B , 
if so, we may denote their common value by o- and write 
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This assumption of equal variances will be used m the fishhne 
example that we are considering. Therefore, the last relationship 
given for <r 4 may be used. 

Example 12.5 Suppose that we consider a numerical example of 
the sampling distribution of d in the fishline problem. Let us assume 
that 50 pieces of fishline are to be treated by method A and that 
another 50 pieces are to be treated by method B Their breaking 
strengths are then to be determined. The difference between the two 
sample means should be approximately normally distributed with 
E(d) = D and 


<Td 


\ n. 


i%a 


Hence if the true difference between [x A and n B were 3 pounds and 
if the standard deviation of each population were 2 pounds, the sam¬ 
pling distribution of d would appear as follows: 



Its mean would be 

E(d) = D = 3 , 

while its standard deviation would be 

* = + ^ 

= 2 ^50 + 50 

= .40. 

Let us now apply the foregoing theory to the development of a 
statistical decision rule for the fishline problem. Such a rule must 
provide the answers to the following four questions: 

1 How should the sample be selected? , , 

2 How should the sample be allocated? That is, how many lengths 
' of fishline should be treated by method A and how many by 

method B? 
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3. What sample size is required? 

4. For what values of d should method A be adopted and for what 
values of d should method B be adopted? In short, what is the 
critical value, d*? 

The answer to the first question is the easiest. The sample, what¬ 
ever its size, should be random. That is, the lengths of fishline must 
be chosen and then apportioned between the two methods by the 
use of random numbers. Some form of randomization is a require¬ 
ment of every statistical study. Without randomization there is no 
basis for the application of statistical theory. 

The second question calls for an answer to the question of allo¬ 
cating the total sample between the two methods. A general answer 
to this question for comparative experiments is as follows: If the 
cost of applying each method is equal and if the two population 
standard deviations are equal, assign half of the total sample size to 
each treatment. This allocation minimizes the standard error of d 
and, hence, minimizes the risks of relying on sample information. 
To illustrate this thought, reconsider the formula for the standard 
error of d when <t a = <r B , 

11 ,T 

The value of <r 4 obviously depends upon the factor 


\j - 1 -, 

yn A n B 

and <r d is minimized when this factor has its lowest value. This 
occurs when n A = ns. 

Example 12.6 Assume that 100 lengths of fishline are to be water¬ 
proofed. Let us consider different values for n A and n B and compute 
the corresponding values of 


/— + - 
\ n A ~ l " n B ' 

n A 

n B 

J- + - 



\jn A n B 

10 

90 

.333 

20 

80 

.250 

30 

70 

.218 

40 

60 

.204 

50 

50 

.200 

60 

40 

.204 

70 

30 

.218 

80 

20 

.250 

90 

10 

.333 
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We thus observe that n A = n B = 50 results in the minimum value of 



and, hence, the minimum value of <r d . 

The way in which we answer the third and fourth questions de¬ 
pends on the objectives of the decision maker. Thus, if the costs of 
applying the treatments were high, one might wish to set the total 
sample size, n, directly. Then, the risk of one type of incorrect deci¬ 
sion could be set in order to find d*. Alternately, one might wish to 
control the risks of both types of incorrect decisions. In this case n 
and d* can both be found from the appropriate equations (as in 

Chapter 8). . 

Here, let us assume that n is set directly. Thus, in the fishlme 
problem suppose that it is decided to select a total sample of 

100_50 lines to be treated by each method. To find the critical 

value, one risk of an incorrect decision must first be set. That is, ei¬ 
ther of the following must be set: 

P (Using Method A if Method B Better) 

OR 

P (Using Method B if Method A Better) 

Moreover, one must describe what is meant by “method B better” 
or “method A better” in terms of a value of D. Thus, suppose that 
the company feels that method A would be more economical if the 
average breaking strength resulting from its use exceeds the average 
resulting from the use of method B by 1 pound (i.e., if D 1). 
Further, suppose that they are willing to take only a .01 risk of 
using B if D = 1. 

Consider, now, possible values of d. We wish to find a critical 
value, d*. For values of d equal to or greater than d*, method A will 
be used, while for values of d less than d*, method B will be used. 
We have a one-tailed decision rule of the form: 


Use Method B 

Use Method A 




d* 

Values of d 

Next let us consider the sampling distribution of d if D = 1, keep¬ 
ing in mind the objectives: the sample size, 

n — 100 , tia — Kb = & 0 > 
and the specified risk, 

P (Using Method B if D = 1) = .01 . 
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If D equals 1, the sampling distribution of d would be normal 
with its mean, E(d) equal to 1. Furthermore, d* must cut off the 
lower tail of the sampling distribution of d so that only a .01 area is 
to the left of d*. Graphically, the situation is as follows: 



Because of the tail area of .01, z equals 2.33. Hence, we have 

d* = E(d) - 2.33 (T d 
= 1 - 2.33<r d . 

Unfortunately, we do not know the value of cr d go we cannot im¬ 
mediately compute the critical value. However, once the sample is 
drawn, <r d can be estimated from the sample and the value of d* 
calculated. Thus, after a sample is drawn, an estimate of <r d 2 can be 
computed from the following formula: 

£/ — UaSa2 + n B 8 n V 1 , _1\ 

tia + Ub — 2 ns) f 

where, as noted, s/ and sb 2 are the two sample variances. Of course, 
an estimate of o- d is provided by the square root of the above. 

Example 12.7 The experiment to provide a basis for deciding on 
a method of waterproofing is conducted. The decision rule is: Treat 
a random sample of 50 pieces of fishline by each method. Observe the 
average breaking strength in each sample and calculate the differ¬ 
ence, 

d — x A — x B . 

If d is less than d*, use method B; otherwise use method A. The 
critical value determined from the sample will be 

d* = 1 - 2.33<r d . 

The results are summarized below: 

Method A Method B 

77a = 50 71b — 50 
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Thus, we observe that 

d = 15.9 - 15.2 
= .7 pounds. 

In order to translate this result into a course of action, we must 
compute the value of d* } 

d* = 1 - 2.33crd . 

Thus, first we estimate a d : 

A 2 ^ n A s A 2 + n B s B V _1_ , _1_\ 
d n A T* s ~~ 2\n A tibJ 

50(.115) + 50(.128)/ 1 , 

__ 98 \50 ^ 50/ 

= .00496 

^ * V;00496 
- .070 . 

Next we compute d*: 

d* = 1 - 2.33 (.070) 

= 1 - .16 
= .84 pounds. 

Lastly we observe that d equals .7 pounds and that this is less than 
the critical value, d* — .84 pounds. Hence, the decision is to use 
method B. 

Let us summarize our discussion in this section. In general, one 
way of formulating a statistical decision rule when D is the appro¬ 
priate decision-parameter is as follows: Decide on the sample size 
directly. Focus on one value of D (Do), and specify a tolerable risk 
of acting incorrectly if D were true. The critical value is then 

d* = Do + Z(T d j 

w^here z is the normal deviate required to yield the specified risk, 
and where may be estimated from the sample. 

Three concluding remarks are in order. First of all, the procedure 
sketched in this section is applicable only under the assumption 
that & A = <tb . If this assumption should not be made, the above 
procedure must be modified. Secondly, the procedure also rests on 
the assumption that the sampling distribution of d is normal. 2 This 
assumption is reasonable if the combined sample size is sufficiently 
large—as a general rule, if n is at least 25 or 30. If n is small, the 
sampling distribution of the pertinent statistic ordinarily cannot be 
assumed normal and special techniques must be incorporated into 
the statistical decision procedure. 

2 Strictly speaking, the assumption is that d/<ra is normally distributed because 
we estimate && from the sample. 
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Lastly, remember that the methods sketched require that n be 
set directly and that only the risk of an incorrect decision for one 
possible state of the world (one value of D) be controlled. There¬ 
fore, usually it is important to look at the operating characteristic 
curve of the rule adopted. This OC curve would show risks for other 
values of D and indicate whether the rule should be modified. 
While, at times, special problems may arise in the construction of an 
OC curve for this type of testing procedure, the general method of 
calculation is similar to that illustrated in Chapter 8. 

12.3 Decisions Involving the Comparison of Two Population 
Proportions 

Sometimes comparative experiments are necessary to provide 
partial information, not on the difference between two population 
means but on the difference between two population proportions. 

Example 12.8 The industrial engineer of a company must design 
a punch-press operation to stamp out inlays from sheet metal. He 
considers two alternative methods which differ only in the way the 
metal is oiled. Method B is slightly more time consuming than 
method A. However, the engineer is uncertain about the proportion 
of defectives to be expected from each method. He, therefore, wishes 
to conduct a comparative experiment to provide a basis for decision. 

Let us denote the two population proportions of interest in this 
type of problem by and w B . Thus, in the punch-press problem, 
ma y be defined as the proportion of defectives that would be 
produced by method A if that operation were repeated indefinitely 
under similar conditions. w B is defined similarly for method B. 
Furthermore, let 

D = 7T A — TT B . 

In Example 12.8, D describes states of the world for the decision 
problem relating to the two methods of operating the punch press. 
Thus, we shall consider 


d — Pa — Pb 

as the pertinent statistic to be derived from an experiment com¬ 
paring the two methods. The statistics, p A and p B , are the propor¬ 
tions of defectives in the two random samples taken from the two 
methods of operation. 

Hence, consider the sampling distribution of d. For sufficiently 
large samples, the distribution will be approximately normal with 
mean equal to 


E(d) — 7TA — TT B = D 
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and variance equal to 

2 — 7Ta(1 ~ 7Ta) , TTb( 1 ~ TTb) # 

^ ~~ 7lA U B 

The latter result follows from the same theory applied to derive 
the variance of (x A ~~ %b) the last section. 

Now, let us develop a statistical decision rule for the type of prob¬ 
lem under consideration. As in the last section, we shall treat the 
case for which sample size is determined directly and one risk of an 
incorrect decision is specified. 

Assume that the engineer in Example 12.8 wishes to spend only 
about one hour of worker and machine time on the experiment. He 
estimates that in that time he can get a sample of 5,000 inlays from 
method A and a sample of 4,800 inlays from method B. Workers are 
to be instructed to punch that many inlays. (Note that here the 
cost, in time, of sampling is not equal and a larger sample from the 
cheaper or faster method is called for.) The engineer feels that be¬ 
cause method B is the slower, if D — 0 he wishes only a .01 risk of 
adopting method B. 

We must now find a critical value, d*, which divides all possible 
results of the experiment—all possible values of d —into two groups 
each of which leads to a different action: 


Use Method A 

Use Method B 




d* 

Values of d 


Thus, values of d less than d* call for the use of method A, while 
values of d equal to or greater than d* dictate the use of method B. 

The engineer indirectly has specified this critical value by indi¬ 
cating that if D equaled 0, the tolerable risk of using method B 
should be only .01. If D is 0, the sampling distribution of d appears 
as follows: 
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Its mean, of course, is E(d) = 0 . Its variance is 

< r / = ~ “*a) ^ 1 — tb) 

71b 

If D = 0, then tt a is equal to ir B) and we may write 

vi = 7r(l — . 

\n A n B J 

The critical value, d*, as can be seen, is 

d* = 0 + 2.33<r d 
= 2 . 33 ad . 

This cannot be immediately computed because cr* depends upon 
the unknown value of tt (= tt A = by assumption). However, 
this value can be estimated from the sample, and d* can be com¬ 
puted at that time. An estimate of 7 r from the samples is provided 
by the proportion of defectives in both samples, p. Hence, we may 
use 

< 7 / = p( 1 - p)(^~ + - 1 -) 

as an estimator for cr d . 

Example 12.9 Let us consider the application of the decision rule 
developed m this section. Suppose that the engineer has 5,000 inlays 
stamped by method A and 4,800 inlays stamped by method B and 
that the results are as follows: 


Method 


Sample 

Number of 

Proportion 

Size 

Defectives 

Defective 

5,000 

255 

.0510 

4,800 

216 

.0450 

9,800 

471 

.0481 

Pa - p B 




Total. 

We now note that 


= .051 - .045 
== .006 . 

Furthermore, since p equals .0481 (the proportion of defectives in 
both samples), we find 

of = p(l - v)(— + —) 

\n A n B J 

= (. 0481 )(. 9519 ) f — 

' \ 5,000 T 4 . 800 / 

= .00001868 


= V .00001868 

= .0043 
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Thus, we find the critical value is 


d* = 2 . 33 ^ 

= 2.33(.0043) 

= .010 . 

Since the value of d is .006 and this is less than the critical value, 
d* = .010, our decision rule calls for the use of method A. 

Our brief discussion of the statistical decision procedure for com¬ 
paring two population proportions should indicate the general na¬ 
ture of such a procedure. However, you should recognize that we 
have treated herewith only the simplest situation—the formulation 
of a rule with n set directly and one risk set for D equal to zero. 
Technical complications, beyond the scope of this book, arise when 
objectives other than these are set. However, the basic ideas remain 
the same. 

12.4 Decisions Involving’ the Comparison of Two Populations 
—Design 

Let us further analyze the illustration discussed in the last sec¬ 
tion. The problem involved the collection of two samples—one sam¬ 
ple of 5,000 inlays produced under one method of oiling the metal 
for a punch-press operation and another sample of 4,800 inlays pro¬ 
duced under a second method of oiling the metal. In making these 
observations as a basis for decision, the industrial engineer must be 
careful to insure that both samples are random samples and that 
both samples are independently drawn. The theory for comparing 
two population proportions presented in the last section rests on 
these assumptions, and so does the theory for comparing two popu¬ 
lation means presented in Section 12.2. 

Because it requires total randomization, the experimental design 
that we have considered to this point in our discussion is called a 
completely randomized experiment. An alternative experimental 
design which may have greater efficiency (that is, be less risky) than 
a completely randomized experiment is called a matched samples 
1 or a paired samples design. 

Example 12.10 A garment manufacturer is considering the pur¬ 
chase of new sewing machines to expedite the manufacture of dresses. 
The decision on whether or not to buy the new machines is to be 
made by a comparison of the average times taken to sew the seams 
on a garment. 

An experiment is to be designed to provide a basis for decision. 
One possible design would be a completely randomized experiment. 
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However, suppose that it were known that the choice of operators 
n * e m ^ ch . mes would influence the results. Then the experiment 
should be designed m such a way as to control this source of variation 
It it were not, the decision procedure would be too risky. For ex-’ 
ample, suppose that experienced operators were inadvertently used 
r th the old machines, while apprentice operators were used 
with the new machines. Would a smaller average time on the old 
machines indicate that they were better than the new ones? Not 
necessarily. Variation in results due to differences in workers'must 
be controlled, and this may be accomplished through the use of a 
matched samples design. 

•+I hUS ’ i suppos f. ^ at . a matched sample design were adopted 
with workers as the basis for matching because they are a potential 
source of variation m the results. In such an experiment, each worker 
in the sample would be observed once on the new machine and once 
the old machine. Thus, if 25 workers were used in an experiment 
25 pairs of observations would be made available. One possible set of 
results from such an experiment follows- F 


(Time Taken to Sew Seams—in Seconds) 



Old Machine 

New Machine 

Difference 

1 . 

ooq 

190 

191 

230 

232 

229 

226 

411 

355 

226 

223 

325 

301 

192 


2 . 

oic 

33 

3. 


27 

4. 

94.Q 

33 

5. 

9fi7 

17 

6 . 

9fi1 

38 

7. 

47 r: 

35 

8 . 

3ft9 

64 

9. 

OAO 

27 

10 . 

OKO 

36 

11 . 

qkk 

35 

12 . 

314 

224 

oon 

30 

13. 

13 

14. 

32 

15. 

269 

984. 

202 

240 

243 

216 

223 

247 

285 

200 

23 

16. 

29 

17 . 

18 . 

19. 

A Otc 

249 

256 

271 

814. 

41 

33 

33 

20 . 

24 

21 . 

OIt 

911 

29 

22 . 

All 

9ft9 

11 

23. 

AOA 

qni 

250 

278 

280 

360 

32 

24. 

OOl 

9QK 

23 

25. 

AoO 

41 A 

15 


ilU 

50 

Average. 

284.72 

254.20 | 

30.52 

•- —— - —— 


You will notice that the difference between the two means is 
284.72 - 254.20 = 30.52 

and that this equals the average of the differences in the last column 
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Before carrying out the matched samples experiment illustrated 
above, it would be necessary to formulate a statistical decision 
rule, and this, in turn, requires that objectives in terms of tolerable 
risks be specified. Thus, let us assume that the sample size for the 
afore-mentioned decision problem was set directly at 25 observa¬ 
tions per machine. In addition, suppose that an analysis of the 
pertinent cost data indicates that if, on the average, the new ma¬ 
chines are 35 seconds quicker than the old machines, only a .01 risk 
of not buying the new machines should be tolerated. Let be the 
average time, in seconds, necessary to sew the seams on the old 
machines and w the average time on the new machines. Further- 

more, let 

D = jUO — i*n * 

It has been specified that if D = 35, the risk of retaining the old 

machines must be only .01. , 

Since n has been set directly, only the critical value, d , need 
be determined for the statistical decision rule. This value should 

divide values of 

d — x o — xn 

into two regions, as follows: 


Retain Old Machines 

Buy New Machines i 


i 


d* 


Values of d 

To find the value of d* we must consider the sampling distribu- 
tion of d for the type of experimental design and the type of 
sampling being utilized. This sampling distribution is for suffi¬ 
ciently large samples, approximately normal. Its mean, E(d) equals 
D. Its standard deviation—the standard error of d, namely v„— 
while ordinarily unknown a priori can be estimated from a sample 
by the following formula: 


= 


s(d> - dy 

n(n — 1) 


(t = b 2, 


, n) , 


where the values of dt represent individual differences in the 

sample. . . , , ,, 

Let us apply this theory to find the critical value for the sew¬ 
ing machine decision problem. If D = 35, the sampling distribution 
of d would appear as follows: 
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You will notice that a tail area of .01—corresponding to the speci¬ 
fied risk of retaining the old machines if D = 35—has been shaded. 
Also notice that this implies d* is 2.33 standard errors less than 
the value of E(d )—that 


d* = E{d) - 2.33 a d 
= 35 - 2.33cr d . 

This value may be computed after the sample is drawn and <r d is 
estimated. For a value of d less than d*, the decision should be to 
retain the old machines. For a value of d equal to or greater than 
d , the decision should be to purchase the new machines. 

Example 1211 Reconsider the data given in Example 12.10. The 
value of the pertinent statistic, d, is 


30.52 


To 




compare this to d * we must first estimate <r A . Hence, note that 

-V. / 7 T\ O 


2 (& - d) 
n(n — 1) 


(*' « 1, 2, . . . , n) 


1 


25 X 24 
_ 3,062.24 
600 
= 5.10. 


[(33 - 30.52)2 + (27 _ 30.52)2 + . . . + (50 - 30.52)2] 


Therefore, 


and 


v* = V5T0 = 2.26 


d* = 35 - 2.33cr d 
= 35 - 2.33(2.26) 

= 29.73 . 

^i nce d equals 30.25 and, hence, is greater than the critical value, 
d , the decision should be to buy the new machines. 
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The utility of the matched samples design which we have briefly 
discussed in this section is that it generally results in smaller risks, 
for the same amount of experimentation, than does a completely 
randomized experiment. (Alternately we might say that generally 
the matched samples design requires less experimentation to 
achieve the same reduction in risks.) 

Any gain of the matched samples design arises from the grouping 
employed. In our sewing machine illustration, the observations were 
grouped by workers since it was logical to assume that times of oper¬ 
ation differed from worker to worker. Grouping on some other bases 
might yield even better results. In any event, the principle to follow 
is to use groupings within which observations are likely to be less 
variable than within the population as a whole. (With this principle 
in mind it is worthwhile for a student to compare the matched sam¬ 
ples design with stratified sampling—both rest on essentially the 
same basic idea.) 

12.5 Decisions Involving the Comparison of Several Popula¬ 
tions 

Many statistical decision problems require experiments to com¬ 
pare not only two but several populations with respect to pertinent 
parameters. There are many different problems which have this gen¬ 
eral requirement, and each of these calls for its own particular type 
of statistical experiment. While a study of most of these is beyond 
the scope of this textbook, we shall consider the most common type 
of problem. This is one for which one must decide whether or not 
the populations under consideration have the same arithmetic 
means. 

Example 12,12 A company’s operations and analysis group has, 
among its duties, the job of rating employees. These ratings are used 
to check on the accuracy of ratings made by departmental foremen. 

There are several efficiency analysts who rate employees in the 
group. In designing the procedure to check ratings (assigning ana¬ 
lysts to departments, and so forth) the supervisor wishes to know 
whether or not there is any significant variation among his ana¬ 
lysts in assigning ratings. If there is not, he need not worry about 
which analyst to assign to which department; if they do differ, he 
must take this into account in designing the rating procedure. 

The last example illustrates a very common type of statistical 
problem. Let us define it statistically. Several populations are to be 
compared. Each population represents the totality of ratings that 
could be assigned by an analyst. (These populations are infinite 
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since the decision problem centers on the rating processes.) The 
problem is to determine whether or not the means of the popula¬ 
tions are equal; for example, to decide whether or not the average 
ratings that would be assigned by analysts (in many, many trials) 
are equal. If they are, one course of action is called for; if they differ 
considerably, another course of action is appropriate. In short we 
have the following: ’ 


Alternative 

State of the World 

£ 

“ 

TR 

II 

TS 

II 

Ml M2 T& M3 5^ . . , 

Assume analysts differ 

Type I error 
Correct decision 

Correct decision 
Type II error 

Assume analysts similar... 

•- — -—-_ 


The simplest design for a statistical study of this type of problem 
would require that a simple random sample of ratings be observed 
for each analyst. Thus, the supervisor might select a sample of 5 em¬ 
ployees for analyst A to rate, another sample of 5 employees for ana- 
lyst B to rate, and so forth. Let us assume there are 6 analysts. Then, 
6 independently drawn samples of 5 employee ratings would be ob¬ 
served. In general, this statistical design would require k independ¬ 
ently drawn simple random samples of size n. 

Alternatives to this simple design will be considered in the next 
section. Here let us see how sample observations drawn in the man¬ 
ner sketched can be analyzed to provide a basis for decision. For rea¬ 
sons that will become obvious once one understands the methods 
employed, this method of statistical analysis is called the analysis of 
variance. 

As an aid in discussing the analysis of variance, let us regard one 
possible outcome of the experiment designed above. Thus, consider 
the following data which represent 6 samples of 5 employee ratings. 
The ratings are on an efficiency scale which ranges from 0 to 50. 


RATINGS OF EMPLOYEES BY SIX ANALYSTS 


A 

B 

C 

D 

E 

F 

42 

43 

38 

44 

32 

30 

38 

42 

28 

44 

42 

43 

27 

45 

33 

38 

38 

40 

49 

38 

40 

41 

29 

25 

44 

44 

42 

27 

30 

44 

40.0 

42.4 

36.2 

38.8 

34.2 

36.4 


To analyze data such as these by the standard analysis of variance 
techniques, one must make the following assumptions: 
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1. The samples are simple random samples, 

2. The samples are independent, 

3. The populations are normal, and 

4. The populations all have the same variance. 

If any of these assumptions cannot be realistically made the stand¬ 
ard techniques that we shall consider must be modified. Recall now, 
that the purpose of applying the analysis of variance is to decide 
whether or not the several population means are equal. Under the 
standard assumptions of normality and equal variances, this is 
equivalent to the decision on whether or not the several populations 
are identical. (If several normal populations all have the same vari¬ 
ance, they can only differ if their means diffei.) 

The analysis of variance technique rests on a three-step pro- 

Ced l U1 The variation within each of the several samples is calculated 
and averaged over all samples. This measure reflects variation m the 

2. The variation between sample means is calculated. This meas¬ 
ure reflects variation in the populations and variation (if any exists) 

between the means of the populations. . 

3. The two measures of variation are compared to determine 
whether or not any variation between the means of the populations 

is indicated. . , ,, , 

Let’s consider each of these steps m detail. First, note that we 

can measure the variability within each sample by the ample 
variance s 2 For the ith sample this shall be denoted by s 4 . Thus, 
the population variance for each population may be estimated as 


<ri 


.2 — 


n — 1 


However, since we assume in analysis of variance that the varia¬ 
bility is the same for each population, we can obtain a better esti¬ 
mate of the population variance by averaging all of the values oi 
6? In general, with k samples we average the k values oi <r 4 li 
we denote this estimated population variance determined on the 
basis of variability within the samples by we have 


2 

k 


(i = 1, 2, . . • , k) . 


Example 12.13 Let us compute cr w for 
above! First, recall that the observations 
38, 27, 49, and 44 and that their mean 


the employee ratings given 
in the first sample are 42, 
is 40. Hence, 
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*• = (jg^jO) 2 + (38 - IQ? + . . . + ftu - -mv 


o-i = -- S 

n — 1 


5 274 

4 ' 


Thu^welave: 1 ' th ° ° th6r ““P* 68 We alS ° c “ find esti “^te S of 


Analyst 

A. .., 

B. . . 

C. . . 

D. . . 

E. . . 

F. . . 


The value of & K 2 is the average of these 6 values, or 

<rj = 43.37 . 

This is an estimate of the variation in the population nf ratine t 
each analyst, based on the variability wifflSSf^pTf® 

Now, consider the variation between sample menns. A measure 

~ 

An estimator of a.; is : 

^.2 = _ A _ - x ) 2 

z k- 1 ~l (f = l, 2, •••,*) 

where x is the over-all mean of the k samples. 

Let s apply this estimator to our ratings illustration. 

Example 12.14 The 6 sample means given above are: 

40.0, 42.4, 36.2, 38.8, 34.2, 36.4 . 

The over-all mean is 38.0. Hence we shall estimate o-/ as 

W = ? • Mg^3g) j+ (42.4 - 38) 2 + . . . + (36.4 - 38)’ 

5 6 

= §. 44,24 

5 ' 6 

= 8.848 . 
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The measure computed in the last illustration reflects variation 
in the populations and variation (if any exists) be^cmi the means 
of the populations. Suppose, for the time being, that the hypoth- 
esis that the means are all equal is true. If so *. should reflect 
only population variation. In fact, the 6 sample means could be 
regarded as coming from the same population. In this case we know 


= n 


Since n 


— 5 for our problem, we have 




or 


5 X a? = 0 s 


Hence if all population means are equal, we may estimate <r on 
the basis of the variation between sample means as follows: 

= 5dx 2 

= 5 X 8.848 
= 44.24 , 

where is an estimate of * 2 based on the variation between sam- 

^The last step in our analysis is to compare and ^ by taking 
the ratio of the latter to the former. Thus, for our illustration the 

variance ratio is 

& b 2 __ 44.24 
&w 2 43.37 

- 1.02 . 

If all population means are equal, this ratio should, on the aver¬ 
age be approximately equal to one (as it is here). However, if they 
differ, <t> 2 should be larger than 0 2 on the average 
than 4„ 2 on the average. In this case we should expect to find the 
variance ratio significantly greater than one. 

The variance ratio, as our discussion has indicated, is the pe 
nent statistic upon which the choice of a course of action should 
rest. High values indicate the means differ; values m the neighbor¬ 
hood of one or lower indicate they do not. 

To formulate a statistical decision rule for t e type o ^ 
considered in this section requires some consideration of the sa¬ 
pling distribution of the variance ratio. This distribution is no • 
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mal. It follows the F distribution, a sampling distribution that is not 
considered in this textbook. Thus, here we cannot find for given lev¬ 
els of risk the critical value of the variance ratio and, hence, we can¬ 
not further discuss the formulation of a decision rule. The purpose 
of the preceeding discussion has been to discuss the logic of the anal- 
ysis of variance. 

In conclusion, we may note that a study to compare several popu- 
ations may involve sampling from each and every population or it 
may involve sampling from a sample of populations. For example, 
m the analysts-ratings illustration discussed in this section, we have 
assumed that a sample of ratings is observed for each and every 
analyst that every population being compared is sampled. This is 
called a fixed model situation by statisticians. If, however, there 
were many analysts and a sample of ratings had been observed for 
only a sample of analysts, the problem would have presented & ran¬ 
dom model situation. This situation involves the comparison of sev¬ 
eral populations on the basis of only a sample of the populations. 

12.6 Decisions Involving the Comparison of Several Popula- 
tions—Design 

. In the last section we considered the simplest design of a statis¬ 
tical study to compare several populations. In short, this design re¬ 
quired k simple random samples of size n to be observed independ¬ 
ently. At times a more efficient design is possible—that is, at times 
it is possible to reduce the risks involved in a study by modifying its 
design. & 

Example 12.15 Reconsider the illustration used throughout the 
last section. Six samples of 5 employees were drawn and assigned to 
the efficiency analysts. The resulting information was used as a basis 
for deciding whether or not the analysts assigned ratings which on 
the average, were equal. J 

An alternative, undoubtedly more efficient, experimental design 
would require that only one set of 5 employees be drawn. Each analyst 
then would be required to rate each of the 5 employees 

The control of employee variation would reduce the possible varia¬ 
tion between sample means and, hence, would reduce the risks of ob¬ 
serving significantly different average ratings in an experiment when 
actually the populations were similar. Ordinarily, the only disad¬ 
vantage of this alternative design is that it requires a slightly more 
complicated analysis of variance. 

. As originally formulated, the design of the study of analysts’ rat¬ 
ings required the analysis of variation arising only from one factor 
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_analysts. The problem was, in fact, to decide whether or not ana¬ 
lysts did vary. As modified in Example 12.15, the problem required 
the consideration of variation arising from two factors—employees 
as well as analysts. In the first case we have a one-factor experiment, 
while in the second case we have a two-factor experiment. It also is 
possible to have three-factor or four-factor or m-factor experiments, 
though the more factors one considers, the more complex the anal- 


There are two reasons for controlling more than one factor in an 
experiment. First, as already noted, the experimental design may be 
more efficient in that the risks of an incorrect decision are reduced. 
Secondly, it is possible that the purpose of the study is to provide in¬ 
formation not only about one factor but about two or more. 


Frnmnle 1216 The research department of a manufacturing 
compan/is developing a new adhesive tape. The uses for which the 
tape is designed dictate that its elasticity is an important quality 
characteristic. Preliminary tests on the tape as it has thus far been 
developed indicate that it is too variable. Thus, an experiment is to 
be conducted to decide which factors contribute to this variability 
and, hence, which factors must be further investigated, modified, or 

otherwise controlled. , . 

Among the factors to be examined as sources of potential variation 

in the experiment are the following 5 factors. Thus, the experiment 
would be called a 5-factor experiment. 


1. Machines used to produce the tape. 

2. Speeds at which the machines are operating. 

3. Grades of the raw material used. 

4. Suppliers of the raw material. ... 

5. Temperatures at which the tape’s elasticity is tested. 


Not only should such an experiment provide information about 
whether or not each of these factors influences the tape s elasticity 
but it also should provide information about how each combination 
of these factors influences it. For example, it may well be that while 
neither machine nor speed in themselves influence quality, certain 
speeds on certain machines do interact to cause differences m the 
quality of the tape. Thus, good statistical design must provide in¬ 
formation about interactions as well as direct effects. 


In connection with the last illustration, it should be noted that 
even today many experiments both in industry and out are based 
on the erroneous idea that in any experiment all but one factor of 
interest should be kept as constant as possible. This is a one-at-a- 
time experimental method. It usually is a naive and inefficient pro¬ 
cedure, however, because it requires several experiments to produce 
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even some of the information that one good statistically designed 
experiment could provide. 

In concluding this discussion let us consider one other illustration 
to indicate different types of experimental designs and the impor¬ 
tance of gearing the experimental design to the problem at hand. 

Example 12.17 3 A supermarket chain feels that the proper pack¬ 
aging of potatoes might increase sales. Therefore, it decides to experi¬ 
ment with 4 types of packaging. These represent the one factor of 
interest in the experiment. However, other factors that might be con¬ 
trolled in order to provide a more efficient experiment include the 
location of the store at which potatoes are sold and the day of the 
week. 

Thus, the supermarket has 4 stores (A, B, C, and D) at different 
locations. Management also decides to limit the first packaging ex- 
peiiment to Monday through Thursday. Below we shall consider and 
compare 3 experimental designs which differ in whether or not loca¬ 
tion and day of sales are controlled. 

As a basis for this comparison we shall deal with a simple situa¬ 
tion and some simple numerical observations. Thus, suppose that 
packaging has no effect on sales; that is, that the means of the 4 pop¬ 
ulations of sales with different packaging are equal. However, we 
shall assume that sales for different days of the week and in different 
stores do vary and, on the average, they are as follows irrespective of 
packaging: 


EXPECTED SALES OF POTATOES 
(Hundreds of Pounds) 


Day of the Week 

Store 

A 

B 

C 

D 

Monday. 

16 

17 

27 

18 

Tuesday. 

19 

29 

22 

17 

Wednesday. 

31 

26 

28 

28 

Thursday. 

23 

15 

27 

17 


First consider the use of a completely randomized design in the 
packaging problem. For this design we would decide on the type of 
packaging to use in a given store at a given time in a completely ran¬ 
dom manner. Suppose that each form of packaging is to be used 
4 times. The form of packaging may be assigned in a random man¬ 
ner by assigning numbers 1-4 to method I; 5-8 to method II; 9-12 
to method III; and 13-16 to method IV. Suppose then, that a table 

•d Brim k an( f Walter T. Federer, “Experimental Designs and 

Probability Sampling m Marketing Research/’ Journal of the American Statistical 
Association, Vol. XLVII (1953), pp. 440-52. 
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of random numbers is consulted and the methods of packaging are 
assigned to the stores for the days of the week as follows: 


COMPLETELY RANDOMIZED DESIGN 


Day of Week 

Store 

A 

B 1 

C 

D 

Monday. 

III 

I 

II 

IV 

Tuesday. 

II 

II 

I 

III 

Wednesday. 

II 

I 

IV 

III 

Thursday. 

I 

III 

IV 

IV 


If the observations indicated earlier are made, we would find that 
the sales for each method of packaging are as follows: 

SALES 

(In Hundreds of Pounds) 


Method: 

I 

II 

III 

IV 


23 

14 

16 

28 


17 

31 

15 

27 


26 

29 

17 

18 


22 

27 

28 

17 

Average: 

22.0 

26.5 

19.0 

22.5 


The range in the means of 7.5 hundred pounds (from 19.0 to 26.5 
hundred pounds) is a rough indicator of the difference in effect of 
the various methods of packaging. The randomizing process has en¬ 
sured that despite the effect of different days of the week and store, 
the variability for a given method of packaging is due to chance. We 
could, therefore, apply the analysis of variance technique discussed 
in Section 12.5 to discover if the average effects of the methods of 
packaging differ. 

Even though the effect of day of the week and store on sales is 
randomized, they still obviously contribute to the difference in sales 
observed for each method of packaging. On the basis of chance 
alone, packaging method II, for example, was never used on Thurs¬ 
day nor was packaging method IV tried in store A. 

Next, consider a two-factor or randomized block design, as it is 
called. We can eliminate the effect of day of the week on the differ¬ 
ence in average sales by ensuring that each method of packaging is 
used during every day of the week. In this case, the days of the 
weeks are called blocks and the method of packaging to be used in 
each store is determined in a random manner. Numbers 1-4 can be 
assigned to methods I-IV, respectively. Numbers 1-4 are drawn at 
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random for each day. The first number determined the packaging 
method for store A the second for store B, etc. Suppose that this is 
done and the results are as follows: 

RANDOMIZED BLOCK DESIGN 


Day of Week 


Store 

—- 

A 

B 

C 

D 

Monday. 

II 

I 

IV 

III 

III 

I 

III 

Tuesday. . . 

I 

IV 

II 

Wednesday. 

IV 

III 

II 

Thursday. . .. 

I 

IV 

II 


^ vctLiuns maicaiea earlier are made, we 
sales for each method of packaging are as follows: 


Method: 


I 

19 

23 

17 

28 


II 

16 

22 

28 

27 


III 

26 

18 

17 

II 

19.5 


IV 

31 

29 

15 

27 


Average: 21.75 24.25 19.5 25.5 

^ ain the range of the means of 6 hundred pounds (from 19.5 to 
25.5 hundred pounds) is a rough indicator of the effect of varying 
the method of packaging. The difference is only 6 hundred pounds 
since we have eliminated the effect of different days of the week 
We could have used the randomized block design to eliminate the 
effect of different stores. The stores would have been considered as 
blocks, and each method of packaging, selected in a random manner 
used on one day during the Monday through Thursday period. ’ 
Lastly, consider the Latin Square design for which it is possible 
to eliminate the effect of different days of the week and different 
stores at the same time. With this design each method of packaging 
is tried m each store and during each day. One systematic Latin 
Square arrangement appears below. You will notice that one 
method of packaging always occurs along a diagonal: 

LATIN SQUARE DESIGN 


Day of Week 

Store 

A 

B 

C 

D 

Monday. 

I 

II 

III 

IV 

IV 

I 

II 

III 

III 

IV 

I 

II 

II 

III 

IV 

I 

Tuesday. 

Wednesday. 

Thursday_ 















396 


' STATISTICS FOR BUSINESS DECISIONS [Oh. 12 

If we now make the pertinent observations given earlier, we find 
the following: 


SALES 



On 

Hundreds of Pounds) 

Method: 

I 

II 

III 


16 

19 

31 


29 

26 

15 


28 

27 

27 


17 

18 

17 

Average: 

22.5 

22.5 

22.5 


You will now observe that the average sales for each method of 
packaging are identical. Thus, when the effect of the difference m 
stores and difference in day of the week has been removed, no dif¬ 
ference in average sales for each method of packaging is revealed. 
In other words, there appears to be no relationship between the 
method of packaging potatoes and the sales of potatoes. . 

Hence, the importance of the proper experimental design is ob¬ 
vious. If the completely randomized or randomized block design had 
been used, we might have concluded incorrectly that the method of 
packaging has an impact upon sales of potatoes. 

Of course, the above data have been arranged to get the indicated 
(and extreme) results—to show how the different designs could re¬ 
sult in different conclusions. Hence, certain important aspects of ex¬ 
perimental design follow from these illustrations. The completely 
randomized design is adequate only where other factors are known 
to exercise a very minor impact on the characteristic of interest, lhe 
randomized block design is adequate when only one other factor 
exerts a sizable influence on the characteristic of interest A Latin 
Square design may be used when two other factors are known to 
have an impact. 

12.7 You Should Now Know That 

A comparative experiment requires experimentation as a basis 
for comparing properties of two or more populations. 

The statistical design of an experiment is a plan for collecting ob¬ 
servations. , , . 

In planning an experiment it is always necessary to relate deci¬ 
sions to possible outcomes. . 

4 common type of decision problem calls for the comparison o 
two populations because the consequences of taking an action de¬ 
pend upon the difference in the means of the populations. 
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A statistical decision rule for such a problem should specify the 
sample size, the sample allocation, the method of selecting the sam¬ 
ple, and the critical value. 

Some comparative experiments are necessary in order to provide 
partial information on the difference between two population pro¬ 
portions. 

Usually matched or paired samples will yield more efficient re ¬ 
sults than a completely randomized experimental design. 

Many statistical decision problems may require experimentation 
to compare several populations. 

A useful set of statistical techniques applicable in many of these 
situations is called the analysis of variance. 

One experimental design for comparing several populations is 
the completely randomized design. 

Other designs which involve controlling one other and two other 
factors, respectively, are the randomized block design and the Latin 
Square design. 

12.8 You Should Now Be Able to Solve 

1. Critically discuss the following statement: The proper de¬ 
sign of an experiment requires that all factors, except the one being 
examined, be held constant. 

2. In each of the following problem situations, suggest an ex¬ 
perimental design. Explain carefully and discuss the reasons for 
your choice. 

a) One of two machines must be adopted for a certain manufacturing 
operation. The one which results in a lower proportion of defective 
product should be chosen. In addition to the possibility that ma¬ 
chines turn out different product, it is thought that the grade of 
raw material used affects the quality of the product. Assume that 
there are 5 grades of raw material used in the operation. 

b) In designing a procedure for testing the melting point of a newly 
developed alloy, a researcher wishes to know whether or not any 
of the following three factors result, on the average, in different 
readings: thermometers used, analysts making the readings, and 
equipment employed in melting the metal. Assume that 5 ther¬ 
mometers, 3 analysts, and 2 sets of equipment are available for 
use. 

c) A taste-testing experiment is to be conducted to decide between 
two methods of preparing a frozen vegetable. Several judges will 
be used to rate the taste of the vegetable. 

3. The maintenance department of a concern must choose be¬ 
tween 2 batteries for use in the firm’s operations. A sample of both 
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types is to be tested to failure to provide a basis for decision. A total 
sample of 50 batteries is to be used in the experiment. If the differ¬ 
ence in average time-to-failure between battery A and battery B is 
30 minutes, then a risk of using Type B, equal to only .01, is desira¬ 
ble. Based on the objectives set, formulate a statistical decision rule. 

4. The experiment sketched in Question 3 is conducted. The fol¬ 
lowing times-to-failure (in minutes) are observed. Apply the rule 
formulated above. 


A 


122 

158 

115 

119 

157 

185 

116 

148 

138 

178 

161 

153 

103 

136 

140 

193 

118 

102 

178 

188 

195 

123 

109 

135 


112 



B 


127 

86 

92 

154 

82 

92 

105 

161 

98 

101 

83 

125 

127 

142 

163 

149 

83 

178 

89 

90 

119 

160 

155 

166 


95 



5. An advertising agency plans an experiment to compare the 
effectiveness of an advertisement in a local newspaper with the 
effectiveness of a direct-mail campaign in small towns. In 5 towns, 
an ad is placed in the local paper. In another 5 towns, a direct-mail 
campaign is conducted. Two simple random samples of 2,500 resi¬ 
dents each are selected. Residents are asked whether they remem¬ 
ber the advertisement. Let 7r A and v D be the proportions of persons 
who remember the ad in each group of towns. It is decided that if 

= 7 r D} then a risk of only .02 of using the direct-mail campaign on 
a nation-wide basis should be taken. 

Formulate a statistical decision rule for this problem based on 
the objectives given above. 

6. The experiment discussed in Question 5 is conducted. It is 
found that Pa is .2125, while p D = .2800. Apply the rule formulated 
above. 

7. The personnel department of a large insurance company 
wishes to test the effectiveness of a suggested program of special 
stenographic instruction. The program would be given to all secre¬ 
tarial personnel in the company. A preliminary experiment is to be 
designed to provide a basis for deciding whether or not to give the 
program. A sample of 25 employees (one class) is to be selected, and 
for each employee the number of words per minute taken in dicta- 
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tion is to be observed. After they complete the program, another 
stenographic test is to be given, and again speed is to be observed. 

Let be the population average for employees before the pro¬ 
gram and pa the average after the program. Furthermore, let D = 
pa - p B . The personnel administrator of the company’ specifies 
that if the training results in no improvement on the average (i.e., 
if Z) 0), only a .01 risk of adopting the program should be as¬ 
sumed. Based on these objectives, formulate a statistical decision 
rule. 

8. The experiment indicated in Question 7 is conducted. The 
following observations (words per minute) are observed. Apply the 
rule formulated above. ' 


Employee 


Before Instruction 


After Instruction 


1 

2 

3 

4 

5 
6 . 

7. 

8 . 
9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

20 . 
21 . 
22 . 
23. 

24., 

25.. 


88 

108 

106 

112 

107 

108 
111 
126 
129 
110 
103 

92 

115 

107 

102 

99 

105 

107 
121 
128 
115 
102 

108 
113 
135 


92 

107 

109 

113 

118 

103 

109 
129 

129 
113 
105 

95 

122 

103 

115 
102 

116 
121 

110 

130 
115 
103 
107 
118 
135 


12.9 You Will Also Find That 

For other brief introductory discussions of experimental design 


£ ches °V- Quality Contro1 and Industrial Statistics, pp. 
735-64. Rev. ed. Homewood, Ill.: Richard D. Irwin, Inc., 1959. 

Tippett, L. H. C Technological Applications of Statistics, pp. 159-84. 
New York: John Wiley & Sons, Inc., 1950. 
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Wallis, W. Allen, and Roberts, Harry Y. Statistics: A Eew Ap¬ 
proach, pp. 479-83. Glencoe, Ill.: The Free Press, 1956. 

More extended discussions will be found in the following key ref¬ 
erences * 

Cochran, William G., and Cox, Gertrude M. Experimental Designs. 

New York: John Wiley & Sons, Inc., 1950. 

Fisher, R. A. The Design of Experiments . 5th ed. Edinburgh, Lon¬ 
don: Oliver & Boyd, 1949. 



CHAPTER • 13 


Decisions by Association 


13.1 Introduction 

In this chapter we are going to cover some statistical theory and 
some statistical applications which are essentially different from 
those in the preceeding chapters. As the title of this chapter tells you, 
we are going to study decisions by association. What does this mean? 
In what ways is this topic different from others we have studied? 
These questions can be answered in an elementary way with refer¬ 
ence to a familiar situation. 

Example 13.1 The main character in this example is a typical 
American housewife, Mrs. Smith. The scene of the example is the 
neighborhood store. Mrs. Smith is considering whether or not to buy 
some grapes. She has a decision problem: to buy or not to buy 

The grocer is a good neighbor. Mrs. Smith is a good customer' The 
grocer, in fact, always allows Mrs. Smith to sample a few (but not 
too many) of the grapes as a basis for her decision. In our termi- 
nology, Mrs. Smith is allowed to make her decision on the basis of 
observations from the population of grapes on which she is making a 
decision. 

In a broad way, the above example describes the type of statistical 
decision problem that we have been discussing in many different 
business connections. Almost all of our problems, including the 
grape-buying decision, have dealt with a specific set of elementary 
units. In addition, the pertinent characteristics of these elements 
have made up a relatively accessible population. Certainly, in all 
our examples, samples could be drawn from these populations and 
observations made on the characteristics of interest. 

(Let s go back to Mrs, Smith at the grocery. 

Example 13.2 Mrs. Smith, having decided on the grapes (she 
bought a pound), eyes the cantaloupes. Unfortunately, she cannon 
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taste the cantaloupe to test its flavor. (Even as good a customer as 
Mrs. Smith is not permitted such liberties.) She, therefore, smells the 
cantaloupe, balances it, and squeezes it. On the basis of observations 
of smell, balance, and hardness, she reaches her decision as to the 
taste of the melon. 

The decision-making experiment in this case is quite different 
from the procedure followed in deciding on the purchase of grapes. 
To reach a decision on the lot of grapes, grapes are sampled because 
the elementary units are accessible and the pertinent characteristic 
(taste) of the grapes sampled can be observed easily. In the case of 
the melon, however, a slice of the melon is not accessible and the 
characteristic of the melon cannot be observed. Mrs. Smith, there¬ 
fore, has to make a decision by association. On the basis of past ex¬ 
perience she is confident that if a melon smells sweet, is balanced 
perfectly, and is neither too hard nor too soft, its taste is pleasant. 
Thus, Mrs. Smith relies on the results of her observations of char¬ 
acteristics that she has found are associated with tasty melons. 

To make decisions by association, we try to predict one variable 
(taste) on the basis of other variables (smell, balance, hardness). In 
general, the variable which we try to predict is called the dependent 
variable . The variables which are the basis for the prediction are 
called independent variables. In this chapter we shall limit our dis¬ 
cussion primarily to situations which involve the relationship be¬ 
tween a dependent variable and one independent variable. This is 
called a simple relationship. In such cases, a statistical study deals 
with a bivariate population; that is, a population which includes a 
pair of observations for each elementary unit. If we were to consider 
a relationship between a dependent variable and two or more in¬ 
dependent variables, we should have to consider a multiple relation¬ 
ship and, hence, a multivariate population. 

Decisions by association are not uncommon in the world of busi¬ 
ness. Let us consider a few illustrations. 

Example 13.3 A personnel department has to decide on the ac¬ 
ceptability of candidates for a position. It would be too time con¬ 
suming and expensive to give each applicant an on-the-job trial. 
On the basis of prior experience, the personnel department has ob¬ 
served that job performance is related to a candidate’s score on a 
series of tests. It, therefore, attempts to predict performance on the 
basis of associated variables—test scores. 

Example 13.4 A firm manufacturing airplanes must weld joints 
on the wing of the plane. The wing must be able to withstand shear- 
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ing forces while in flight. The test to determine shearing strength of 
a joint, however, is destructive in nature. Therefore, an associated 
variable—the weld diameter—is observed to predict shearing strength. 

Example 13,5 An appliance manufacturer desires to predict his 
company’s sales for the coming year. It has been found that dis¬ 
posable personal income is associated with the sales of the firm. The 
firm, therefore, predicts its sales on the basis of estimates of dispos¬ 
able personal income made by economists. 

Example 13.6 A cost accountant wishes to predict the cost of 
material handling for the coming year. He uses an associated vari¬ 
able—direct labor cost—to make his decision on the estimate. 

In each of the above illustrations, because of considerations of cost, 
time, or availability of data, one variable is estimated on the basis 
of its relationship to an associated variable. 

13.2 Two Aspects of Association 

There are two aspects to problems involving decisions by associ¬ 
ation. One aspect concerns itself with the degree of relationship. 
This refers to the extent to which decisions are enhanced by taking 
into consideration an associated variable. The second aspect concerns 
itself with the nature of the relationship. This refers to the functional 
relationship between the variables which enables us to predict the 
value of a dependent variable from the value of an independent 
variable. 

Example 13.7 In the field of education the degree of association 
between arithmetic and reading is of interest in order to reach de¬ 
cisions on methods of teaching. If it is found that there is a high de¬ 
gree of association between arithmetic and reading ability, a pupil’s 
reading ability should be considered in diagnosing retardation in 
arithmetic. The interest here is not to predict a pupil’s arithmetic 
ability on the basis of his reading ability but to reach a decision on 
whether arithmetic ability might be enhanced by reading ability. 
Therefore, the degree of relationship is relevant. 

In contrast, all the business illustrations in Examples 13.3 through 
13.6 are illustrations of problems where the nature of the functional 
relationship is paramount. Thus, to reach a decision on sales for the 
following year, on the basis of disposable income, it is necessary to 
know the nature of the relationship between the variables. 

Statisticians refer to problems concerning the degree of relation¬ 
ship as correlation problems and to those involving the nature of re- 
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lationship as regression problems. We can gain further insight into 
the nature of correlation and regression problems by considering the 
diagrams in Figure 13.1. 

Figure 13.1(a) illustrates the relationship between wages earned 


Figure 13.1 

SOME RELATIONSHIPS BETWEEN PAIRS OF ASSOCIATED VARIABLES 



HOURS OF OVERTIME PER DAY 


(b) 



NUMBER OF UNITS PRODUCED 



(INCHES) 


and hours worked when the wage rate is $2.00 per hour. The nature 
of the relationship is described by a straight line whose equation is 
Y = 2X, where Y represents wages earned and X hours worked. The 
degree of relationship or correlation in this case is perfect since once 



Oh. 13] DECISIONS BY ASSOCIATION 405 

the number of hours worked is known, the amount a worker has 
earned is determined exactly. 

Figure 13.1(6) represents the relationship between the amount to 
which an investment of $1,000 (compounded annually at 5 per cent) 
will accumulate and the number of years that the money is invested. 
Here, again, the degree of relationship is perfect. However, in this 
case, the nature of relationship is not described by a straight line but 
by a curve. The curve is an exponential curve and is expressed by the 
equation: Y = 1,000 (1.05) x 

Figure 13.1(c) illustrates the nature of the relationship between 
overtime pay and the number of hours overtime worked a day for an 
agency that allows only a flat $2.00 for overtime. Obviously, there 
is no relationship between hours of overtime worked and pay since 
no matter how many hours of overtime an employee works, the pay 
remains the same. Graphically, the absence of relationship is rep¬ 
resented by a straight line parallel to the X-axis. 

Figure 13.1(d) represents the relationship between total cost and 
number of items produced by a firm. The degree of relationship in 
this case lies somewhere between the two extremes of perfect rela¬ 
tionship and no relationship. Decisions can be enhanced by the use 
of the associated variable in such instances, but the value of one 
variable cannot be precisely determined by the value of the asso¬ 
ciated variable. A straight line seems to offer a description of the 
nature of the general relationship. This illustration represents a 
situation that is typical of association between variables encountered 
m business problems. 

Figure 13.1(e) also represents a relationship that is not perfect 
but in this case the nature of the relationship is better represented 
by a curve rather than a straight line. This relationship tells us that 
yie d per acre increases with the amount of rainfall and then starts 
to decrease. This is logical for if rainfall were to increase indefinitely 
crops would be flooded and there would be no yield. 

As we have seen, regression relationships can be represented by 
straight lines or by curves. Relationships described by straight lines 
are referred to as linear; those represented by curves as curvilinear . 

However, m this text we shall limit our discussion to linear relation¬ 
ships. 

The diagrams that were drawn in the previous illustration are 
called scatter diagrams. Scatter diagrams are very useful devices for 
investigating the nature of the relationship between variables. By 
convention, the independent variable is considered the X-variable, 
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and it is plotted on the horizontal scale; that is, on the X-axis. The 
dependent variable is considered the F-variable, and it is plotted on 
the vertical scale or F-axis. 

In any problem involving a decision by association, a scatter dia¬ 
gram will provide a clue to the nature of and the degree of the rela¬ 
tionship. Many erroneous and needless computations can be avoided 
by this simple device. Thus, the use of a straight line relationship 
when a curve would offer the proper description would lead to unre¬ 
liable results, and perhaps to an incorrect decision. 

13.3 Association and Causation 

A word of caution is needed in the interpretation of the meaning 
of association. The mere fact that two variables are associated does 
not imply that one variable is a cause of the second variable. Thus, 
the fact that the smell and the taste of most melons are related does 
not mean a sweet smell causes a sweet taste. Neither is the fact that 
an individual who scores high on a test also performs well on the job 
mean that a high test score is a cause of good performance. 

The presence or absence of association can be demonstrated 
through the use of statistics. However, the determination of whether 
a cause-and-efiect relationship exists depends on knowledge of the 
substantive field to which the data apply. Thus, it can be shown 
that the incidence of heart disease is associated with the level of 
consumer prices in the United States. However, this does not prove 
that an increasing price level is a cause of a higher incidence of heart 
disease. To prove cause and effect, a researcher would have to con¬ 
nect, on the basis of subject matter, high prices and the physiology 
of the heart. Although the statistical association is real,, its interpre¬ 
tation as evidence of a cause-and-effect relationship is erroneous. 

Why is it that .associations can be shown even though ea.use-and- 
effect relationships are not present? One reason for statistical asso¬ 
ciation between variables that are not related logically is that they 
are both related to other factors. Thus, the relationship between con¬ 
sumer prices and the incidence of heart disease occurs because both 
of these variables have been increasing over a period of time. The 
passage of time is the third factor implicitly responsible for the sta¬ 
tistical relationship. 

Example 13,8 During the past 25 years there has been a close re¬ 
lationship between teachers’ salaries and liquor consumption in the 
United States. This relationship can be explained by the tact that 
both variables are related to the level of income in the country. In- 
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creases or decreases) in teachers’ salaries are associated with in 
reases (or decreases) m national income. Similarly increases (or 
decreases m liquor consumption are associated with increases £ 
decreases m national income The relation*!^ w ncleases (° r 

hoS ofTud^Twhy tte numbevtl Sort’ ’T 

lage was related to the size of its population? * E S h Vl1 ' 

. , f second ^ason for the observation of a relationship between var 

stze sample Thuf'r CaUSe ^ n + d 'f ect relationship exists is a small- 
. _ pie. Thus, it is possible to draw a sample which exhibits some 
association even though there is no relationship between the vari¬ 
ables m the population. In a later section of this chapter we will de¬ 
scribe a test to see whether the association we observe could be 
merely a chance relationship. 

in A * kTr 1 rea T, for an observed relationship may be due to the fact 
that both variables contain a common element. This is especially true 
if ratios with common elements are associated with one another If 
two ratios have the same denominator, then a small denominator 
tends to make both ratios large and a large denominator tends to 
make both ratios small. Conversely, if two ratios have the same nu¬ 
merator, a small numerator will tend to make both ratios small and 
a large numerator will tend to make both ratios large. It is not sur¬ 
prising therefore, to find that some correlation exists between the 

‘‘ n,i the net profite/,,e ‘ “ ies 

m ious corrdation" and “nonsense correla- 

ion,_ often are used to describe situations where a high degree of 
association exists without any cause-and-effect relationship. How¬ 
ever, it is not the degree of association that is spurious—it is the ex¬ 
planation of the high degree of relationship. Therefore, great care 
must be taken m explaining the meaning of a close statistical asso- 
ciation between variables. 

If a cause-and-effect relationship is to exist, however, there must 
be association between the variables. In recent years, there has been 
much debate as to whether cigarette smoking is a cause of lung can¬ 
cer. To demonstrate the possibility of a cause-and-effect relation- 

sodaL W H e Tt? la t0 demonstrate that the two variables were as¬ 
sociated. Had this association not been demonstrated, it would not 

ave been possible to support a claim that smoking is a cause of lung 

cancer. However, the existence of the statistical association does not 
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prove that a cause-and-effect relationship exists. The interpretation 
of the observed association is the basis of the present conflict. 

We can sum up the discussion in this section with the statemen . 
While causation implies association, association does not imply can- 

sation. 

13.4 A Population Model for Regression Problems 

We now direct our attention to one of the aspects of decisions by 

JS.tion: regression, the nature of 

this discussion, let us reconsider some subject matter from an earli 
chapter In order to estimate the relative frequency of observations 
Sto given limits in . population we ordinarily must assume some 
population model (such a. the normal curve). For "P e ' 
that we are studying the specifications for the diameters of ball bear 
** produeed by a given process. Further suppose that we estimate 
that the diameters average .15 inches and that their standard devia- 
tion is 002 inches. If we then assume that the normal curve is an 
adequate population model, we may state: “About 95 per cent of 
the ball bearings produced will have idiameters ’between .W6 and 
.154 inches.” The basis for this statistical prediction is the assump 

tion of a normal population model. , 

In a regression problem we are interested in predicting one vari¬ 
able on the basis of its relationship to a second variable. In order to 
make statistical predictions we again must make certain assumptions 
about the population. These assumptions constitute a bivariate pop- 

Ula, Yh.Q particular model with which we shall be concerned in this 
section is called the simple linear-regression model. There are many 
“her modi which on be need to describe the .ssoo.atron between 
two variables. The simple linear-regression model, bo ”™b “ 
which frequently applies to the association between variables and is 
one for which the techniques for making predictions have been de- 

wTcan introduced characteristics of this model through an lllus- 
tration. 

Example 13.9 A manufacturer of detergents is interested m pre- 
dieting the behavior of a particular compound under varying tem- 
neratures This will help him to decide on whether the new compound 
FsTood enough to be produced. The relationship between detergency 
fmd°temperature is aF important factor. 

oln+hine water at low temperatures is used. The detergent mus , 
therefore, have sufficient detergency at low temperatures to have a 
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cleansing effect. Coarser items of apparel are washed at higher tem¬ 
peratures. The detergency at these high temperatures should not be 
so excessive as to impede the proper operation of the washing ma¬ 
chine, or to interfere with the washing and rinsing functions. 

The manufacturer desires to determine on the basis of a sample 
study the nature of the relationship between the amount of deter¬ 
gency and temperature so that he can predict the behavior of his 
product under varying temperatures. 

The population in this problem is bivariate. For each elementary 
unit (a fixed amount of detergent) we must observe two character¬ 
istics-—its detergency and the temperature to which it is subjected. 
Let us consider the assumptions which this bivariate population 
must satisfy for the simple linear-regression model to apply. 

The first assumption is that the relationship in the population is 
linear. That is, a straight line describes the relationship between de¬ 
tergency and temperature. In Figure 13.2 the straight line describes 
the relationship between the two variables. We call this line the 
population regression line of Y on X. Detergency is the F-variable 
since it is the variable we desire to predict. Temperature is the 
X-variable since it is the associated variable to be used as a basis 
for prediction. Hence, the line represents the regression of deter¬ 
gency on temperature. 

A second assumption is that for a given value of X, the mean of 


Figure 13.2 

POPULATION RELATIONSHIP BETWEEN DETERGENCY AND TEMPERATURE 
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the F-values falls on the line of regression. If we refer to the chart, 
we note that for a temperature of 85 degrees the average index of de¬ 
tergency is 50. Similarly, the average index of detergency at a tem¬ 
perature of 115 degrees is 100. 

A third assumption is that the variance (and standard deviation) 
of the F-values about the line is the same for each X-value. This 
means that for any given temperature the scatter of the index of 
detergency about the line should be the same. 

A fourth assumption is that the F-values for a given X-value are 
normally distributed. For example, consider the distribution of in¬ 
dexes of detergency at a temperature of 85 degrees. To apply the 
model under consideration we must assume that the indexes of de¬ 
tergency at this temperature are normally distributed. The same 
assumption also must be made at all other temperatures considered. 
Graphically, we may depict the assumption for our illustration by 
Figure 13.3. 


Figure 73.3 

POPULATION RELATIONSHIP BETWEEN DETERGENCY AND TEMPERATURE 



The assumptions of a simple linear-regression model are sum¬ 
marized below: 

1. The population relationship is linear. 

2. The mean of the F-values for a given X-value lies on the regression 
line. 
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4. The distribution of F-values for any X-value is normal. 

Let us now develop some of the properties of the simple linear- 
egression model more fully and formulate them in symbolic terms 
First of all, we know that m order for the model to be applicable the 
;vaues associated with a given X-value must be normally dis- 
n u e . Thus each set of F-values comprises a normal curve with 

by r symbol We ^ ^ ™ 


which is read “the mean of the F-values for a given X.” We alreadv 

FTaMeTSex f a 5 f ° f 85 d ^ees, the average 

(mdex of de tergeney) is 50. This indicates that 


MF- 


50 


Similarly, the fact that at a temperature of 115 degrees the average 
k -value is 100 may be indicated by g6 

MF115 = 100 . 

In order to meet the conditions of the simple linear-regression 
model, all of the values of must lie on the regression line This 
hne, which relates ^ and X, may be expressed,^ ge^as 

»r-x = A + BX . 

the equatioif is enCy temPeratUre " Sh * 11 «».t 


nv-x = -91.67 + 1.67X. 

You may recognize the general form of this relationship as the 

ST equatl0n f ° r a str aight line. A and B are constants which 
determine a particular line; often A is called the F-intercept and 
is called the slope of the line. Thus, A is the value of ^ x when X 

In our problem 0 ^’ t n0t haVe “ y practical si gnificance. 
our problem it is not necessary to consider a value of zero for X 

temperature) . The value of A (-91.67), therefore, has no meanTnf 

haS ° n ^ mathematical significance-together 
with B it locates the regression line. 

in ?T Sl ° Pe ’ ?’ referS t0 the Change in >*■* P er unit of increase 
X. In our reference example, B represents the change in the aver- 
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age index of detergency as temperature increases 1 degree. Since 
B = 1.67, this means that i^. x increases 1.67 units per degree m- 

Cre When there is no relationship between the independent and de¬ 
pendent variable, the value of B is zero. In other words, there is no 

change in My x unit of increase in X. 

The two constants in the regression equation have been symbol¬ 
ized by the capital letters, A and B, because they are population 
parameters. (The Greek letters, a and ft, have not been employed 
because they already have been used in this textbook m another 
connection.) In the next section we shall consider the Problcm o 
estimating A and B on the basis of samihe observa ionsjOT t 
present, however, note that once we have determined A and , 
can find, from the regression line, the average value of the deter 
gency index for any given temperature. 

Example 13.10 We have assumed that for the bivariate popula¬ 
tion of detergency and temperature 

Mr . x = -91.67 + 1.67X . 

Hence to determine the average index of detergency if^the^etergent 
were exposed many times to temperatures of 90 degrees, w 
X = 90 and find 

„ y . 90 = -91.67 + 1.67X 
= -91.67 + 1.67(90) 

= -91.67 + 150.3 
= 58.6 

The regression line that we have considered in this section de¬ 
scribes the relationship for the bivariate population where tem¬ 
peratures range from 80 to 120 degrees. We have assumed nothing 
about the relationship that exists outside of this range. Hence, w 
should not use the regression line to determine the average index of 
detergency for temperatures, for example, of 40 or 140 degrees. Of 
course we could substitute X-values of 40 and 140 degrees m the 
regression equation and determine the corresponding average in¬ 
dex of detergency. However, this use of the regression line would be 
on the basis of our judgment that the same relationship would exist 

at these temperatures. „ ,, v for 

Let us now consider the standard deviation of the T-values tor 
a given X-value. That is, let us consider the scatter about their 
mean („,*) of possible indexes of detergency for a given X-vahie 
Since the distribution of F-values for a given X-value is normal, if 
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we know their standard deviation the extent of scatter can be de¬ 
termined. The formula for the standard deviation of the F-values 
ior a given X-value is 


O' Y-X — 


^F-X ) 2 


where F is an observed F-value for a given X. The standard devia¬ 
tion of the F-values for a given X is called the standard deviation 
of regression or the standard error of estimate. The latter term may 
be confusing because is a property of a population, and the 
erm standard error” usually is associated with errors in sampling 

procedures. Hence, we shall use the term “standard deviation of re- 
gression. 

Example 13.11 Let us assume that the standard deviation of re¬ 
gression for the index of detergency is 6 units. Then we know that 
approximately 68 per cent of the time, if a detergent is used with 
water at 115 degrees, the index of detergency will fall between 

M1M15 - <Ty .ns and /1K.J15 + 0-J..JJ5 ; 

100 - 6 and 100 + 6 , 

94 and 106. 

Similarly, about 95 times in 100, the index will fall between 

Mf-115 ~ 2 0-F.H5 and fi Y . m + 2cr F . 115 , 

100-12 and 100 + 12, 

88 and 112. 

Outside of what limits will the index fall about 27 times in 10,000? 

13.5 The Estimated Regression Line 

In almost every practical problem requiring decisions by associa¬ 
tion, the population line of regression and the population standard 
deviation of regression are unknown. They must be estimated on the 
basis of the results of a statistical study. 

The standard sampling procedure in a regression problem for 
which the simple linear-regression model is applicable is to observe 
random samples of F-values for fixed values of the X-variable. Thus 
m the detergency problem the detergent might be tested several 
times at each of several fixed temperatures. For example, four tests 
might be performed at a temperature of 85 degrees, four other tests 
made at a temperature of 100 degrees, and four other tests made at a 
temperature of 120 degrees. The outcomes (observed indexes of de¬ 
tergency) would be assumed to constitute three random samples of 
four observations. 
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You will observe that in this experimental procedure we assume 
that the X-variable is completely controlled and that the X-values 
do not vary due to chance. They are fixed, or as is sometimes said, 
the values of X are error-free. For each fixed value of X, however, 
a random sample of F-values is observed. 

How do we know that the 7-values for a given X-value constitute 
a random sample? How do we know that these 7-values are inde¬ 
pendent of each other—that the magnitude of one does not influ¬ 
ence the magnitude of a second? The answer to these questions is 
similar to the answer to a similar question considered m our study o 
statistical quality control. In that field of application it was essentia 
that a process behave like a random process m order for the control 
chart to be used. However, this could be assumed m most applica¬ 
tions because experience had indicated that most production proc¬ 
esses behaved as random processes. Similarly, here it seems reason¬ 
able to assume that outcomes of the detergency test should not in¬ 
fluence one another—that they should constitute random samples 

from the population. . « , 

Suppose that samples of 7-values are observed for various fixed 
values of the X-variable. A and B would then be estimated on the 
basis of the sample data. The estimate of A is denoted as a, the esti- 
mate of B as b. Hence, the equation of the line as estimated from the 

sample is 

(ly-x = a + bX . 

On other occasions, we have indicated that statisticians very often 
use unbiased estimates of parameters. We select, for a and b ,^values 
which are unbiased estimates of A and B, respectively. The for- 
mula for b is: 


b = 


riZxy — 2x%y 


nZx* - (Sz) 2 ' 

where x and y represent values of the variables in the sample and n 
is the sample size. The formula for a is. 

a — y — hx . 

Since we have washed our previous illustration clean, let us turn 
to a new example. 

Example 13.12 A large mail-order house feels that there is an 
association between the weight of the mail it receives and the vol¬ 
ume of orders to be filled. It desires to investigate the matter and 
to determine the nature of the relationship, if an association does 
exist A knowledge of the number of orders early m the day will be 
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of tremendous help to the organization in planning its daily schedule 
of operations. The need for such planning is apparent when one real¬ 
izes that a mail-order house handles from 1,000 to 4,000 orders in a 
10- to 15- or 20-minute cycle. This means that anywhere from 1,000 
to 4,000 orders are being processed, assembled, packed* and shipped 
through the same operational channels during each cycle. Any over¬ 
lapping of these cycles can result in complete chaos. 

To illustrate the calculations necessary to estimate the regression 
line, let us assume that a sample of 25 observations is selected within 
a range of X-values from 200 to 600 pounds of mail. Five observa¬ 
tions are made for 200 pounds, live for 300 pounds, and so forth. The 
results are as follows: 


Observation 

Weight of Mail 
(Hundreds of Pounds) 

Number of Orders 
(in Thousands) 

1. 

2.0 

6.4 

2. 

2.0 

8.0 

3. . .. 

2.0 

7.2 

4. 

2.0 

7.5 

5. 

2.0 

6.9 

6. 

3.0 

10.9 

7. 

3.0 

10.3 

8. 

3.0 

9.5 

9. 

10. 

3.0 

3.0 

9.7 

10.6 

11. 

4.0 

12.5 

12. 

4.0 

12.9 

13. 

4.0 

14.0 

14. 

4.0 

13.8 

15. 

4.0 

12.8 

16. 

5.0 

16.5 

17. 

5.0 

17.1 

18. 

5.0 

15.4 

19... 

5.0 

16.2 

20. 

5.0 

15.8 

21. 

6.0 

19.0 

22. 

6.0 

19.4 

23. 

6.0 

19.1 

24. 

6.0 

18.5 

25. 

6.0 

20.0 


We first construct a scatter diagram to investigate the nature of 
the relationship. This is illustrated in Figure 13.4. You will observe 
that each of the plotted points in the chart has been identified by a 
number to facilitate references to the original data. The scatter dia¬ 
gram indicates that a straight line may be used to describe the na¬ 
ture of the relationship between the two variables. In Example 
13.13 the procedure for estimating the regression line is shown. 

Example 13.13 The worksheet needed to estimate the regression 
line is arranged as follows: 
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WORK SHEET FOR COMPUTATION OF REGRESSION LINE 


Observation 

Weight of Mail 
(Hundreds of Pounds) 

X 

Number of Orders 
(in Thousands) 

y 

xy 

3 2 


2.0 

6.4 

12.8 

4 

? 

2.0 

8.0 

16.0 

4 

3 

2.0 

7.2 

14.4 

4 

4. 

2.0 

7.5 

15.0 

4 

9.9 

6.0 

19.4 

116.4 

36 

23 

6.0 

19.1 

114.6 

36 

9A 

6.0 

18.5 

111.0 

36 

25. 

6.0 

20.0 

120.0 

36 

Total. 

100.0 

330.0 

1,470.0 

450 


n = 25 Zxy = 1,470.0 

Xx = 100.0 Xx % = 450 

Xy = 330.0 

_ nXxy — XxXy 
b ~ ~nZx i - (Xxf 
_ 25(1,470.0) - (100.0) (330.0) 

25(450) - (100.0) 2 
_ 36,750 - 33,000 
~ 11,250 - 10,000 
3,750 
- 1,250 

= 3.00 thousand orders per 100 pounds 

This value of b means that for each increase of 100 pounds in 
weight we estimate that the number of orders will increase by 

3 , 000 . 

a = y — b$ 

_ Xy _ bXx 
~~ n n 
__ 330 _ 3(100) 

25 25 

= 13.2 - 12 
= 1.20 thousand orders 

Hence the equation of the regression line is 

fX Y .x = 1.20 + 3.00X . 

Since X ranges from 200 to 600 pounds in our sample, this regression 
equation should be used only when the daily weight of mail falls 
within this range. 
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Figure 7 3.4 

RELATIONSHIP BETWEEN NUMBER OF ORDERS AND WEIGHT OF MAIL 



The Least-Squares Method . The regression line that we have 
computed also can be derived by a technique statisticians call the 
least-squares method. For the simple linear-regression model, the 
unbiased estimates of A and B are identical with the valqes we 
would derive by the least-squares method. Historically, the least- 
squares method preceded the development of the concept of un¬ 
biased estimates. The technique was introduced by the French 
mathematician, Adrien Legendre, more than 150 years ago. 

The least-squares method fits a regression line to the data that 
satisfies the.following criterion: The sum of the squared deviations 
about the line is a minimum. 

To determine the values of a and b for the least-squares equation, 
a set of normal equations must be solved. These equations are: 

(1) Xy = na + bXx 

(2) Xxy = aSa; + bXx 2 
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If we were to solve these equations for a and b for the data in Ex¬ 
ample 13.13, we would get the same answers as before. 

The Standard Deviation of Regression . We have stated previ¬ 
ously that the standard deviation of regression in the population 
measures the variability of the F-values about the population re¬ 
gression line. In a practical problem we must estimate the standard 
deviation of regression from the sample. It can be shown mathe¬ 
matically that an unbiased estimator for the variance of regression is 


O'Y-X 2 = 


S(y - mm ) 2 
n — 2 


An estimator for the standard deviation of regression is, therefore, 


<? Y-X 



(y — Mr-x) 2 # 
n — 2 


Example 13.14 We estimate the value of cr Y -x below. Note that 
the estimated number of orders is determined by substituting the ac¬ 
tual weight in the regression equation: 

gr.x = 1.20 + 3.00X . 


WORK SHEET FOR COMPUTATION OF STANDARD ERROR OF ESTIMATE 


Obser¬ 

vation 

Weight of Mail 
(Hundreds of 
Pounds) 

X 

Observed Number 
of Orders 
(in Thousands) 

y 

Estimated 
Number of Orders 
(in Thousands) 

fJrY'X 

y — iiY-x 

(y — ixr-xY 

1 . . . . 

2.0 

6.4 

7.2 

-0.8 

.64 

2 . . . . 

2.0 

8.0 

7.2 

+0.8 

.64 

3. . . . 

2.0 

7.2 

7.2 

0.0 

.00 

4. . . . 

2.0 

7.5 

7.2 

+0.3 

.09 

22 .... 

6.0 

19.4 

19.2 

+0.2 

.04 

23.... 

6.0 

19.1 

19.2 

-0.1 

.01 

24.... 

6.0 

18.5 

19.2 

-0.7 

.49 

25.. . . 

6.0 

20.2 

19.2 

+0.8 

.64 

Total.. 

100.0 

330.0 

330.0 

0.0 

7.52 


<?Y'X 




Mf-x 


n 

IU52 

\ 23 

= .572 thousand orders 


You will observe that the sum of the deviations about the line is 
equal to zero. This always should be so when unbiased estimators are 
used and offers a method of checking the accuracy of calcula- 
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tions. Also note that the sum of the estimated 7-values is equal to 
the sum of the actual values and, hence, the mean of the estimated 
values is equal to the mean of the actual F-values. This provides an¬ 
other check on the calculations. 

13.6 Sampling Error of Estimated Regression T-m? 

We need a regression line to predict the value of a dependent vari¬ 
able on the basis of the value of an independent variable. For ex ¬ 
ample, in the previous illustration the regression line is needed to 
predict the number of orders on the basis of the weight of the mail 
received. This prediction would enable a mail-order house to plan 
its activities for the day. 

Predictions based on a regression line derived from a sample will 
be subject to two sources of error. In the first place, the relationship 
between the two variables may not be perfect. Such errors in pre¬ 
dictions would occur even if the population regression line were 
known. They do not arise because of sampling. The size of this error 
is indicated by the standard deviation of regression. In the second 
place, the regression line is computed from a sample and, hence, 
does not necessarily show the true position of the population regres¬ 
sion line. This section is devoted to a discussion of the nature of the 
sampling errors and their measurement. 

The Sampling Error of h. The slope (b) of an estimated regres- 
sion line is a random variable. Let us, therefore, consider the pattern 
of variability of b. If we were to draw a very large number of sam¬ 
ples of n pairs of observations from a population that meets the as¬ 
sumptions of a simple linear-regression model, and if the values of X 
were the same for each of the samples, then the b values would be 
normally distributed and their mean would be equal to B. That is, 
the expected value of b is B. Furthermore, the pattern of variability 
of these b values could be determined if we knew the standard error 
of b. The formula for the standard error of h is 

_ _ &Y-X 

&b — — prr-r—■ ; . 

v2(x — x) 2 

You will recognize the numerator as the population standard devi¬ 
ation of regression. The denominator is the square root of a sum— 
the sum of the squared deviations of the X -values about their mean. 

The formula reveals three facts about the standard error of the 
slope: (1) The smaller the standard deviation of regression, or the 
more perfect the relationship in the population, the smaller is 
the standard error of the slope. (2) The larger the sample size, the 
smaller is the standard error of the slope. This follows since the 
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larger the sample, the more (x - x) 2 terms are added. This increases 
the denominator and decreases «„■ (3) The greater the variability 
of the x-values, the smaller is the standard error of b. 

Since we do not know the population standard deviation of re- 
gression in any given problem, we must estimate its value from a 
sample. Hence, an estimator for may be written as 

, &Y-X _ . 

(Th — — — 

Vx(x — xf 


The fact that the estimated value of the slope is not zero does not 
mean that a relationship necessarily exists between the variables m 
the population. A nonzero value of b might arise because of sam¬ 
pling error. To know with certainty whether a relationship does or 
does not exist, it would be necessary to know the value of the slope 
in the population. If B were equal to zero, no relationship would 
exist and the regression line would be useless to predict Y on the 
basis of X. If B were not equal to zero, a relationship would exist 
and the regression relationship might be useful. Since population 
data are not available, it is necessary to reach a decision on whether 
or not a relationship exists on the basis of sample data. The decision 



Action 

State of the World 

B = 0 

B ^ 0 

Consider line as 

ViootB frvr nrpnifltlOTl 

Incorrect 

decision 

Correct 

decision 

Correct 

decision 

Incorrect 

decision 

UtitoiO lUi jji CUIvVAVil . ****** 

Discard line as 

Unoio f at* r\Vpniptinri ... « 

DStSlo I or picumtiuu.. 


We shall consider situations in which the sample size is given. In 
such cases it is possible to control only one type of error—the error 
that we are most eager to avoid. This error usually is to use the 
regression line if no relationship exists. Thus, if we set a value for 
the a-risk, we can derive a decision rule for which the probability of 
using the' regression line when no relationship exists is equal to <*. 
As usual, we determine the appropriate value of a on the basis of 
the costs entailed in making an incorrect decision. In the mail-order 
house problem, the consequences of predicting the number of orders 
on the weight of mail, if no relationship between the variables exist, 

must be considered. . . , 

What we seek, therefore, is a decision rule to give us the desired 
protection against the use of the regression line to estimate Y on the 
basis of X if no relationshin exists. The decision rule would have the 
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following form: If in a sample of n items from a bivariate popula¬ 
tion b is greater than b* (the critical value), consider the regression 
line as a basis for prediction; if b is equal to or less than b*, discard 
the regression line as a basis for prediction. 

We have noted that the sampling distribution of b is normal. It 
is, therefore, necessary to determine a value of b* so that the prob¬ 
ability of obtaining values of b equal to or greater that b* is a if B is 
equal to zero. This situation is illustrated below: 



b* = 0 + Zffb 

or 


b* = zcr b . 

Example 13.15 Let us now derive a decision rule to determine 
whether or not to consider the use of the regression equation in Ex¬ 
ample 13.13 to estimate the number of orders on the basis of the 
weight of mail. 

We set an a-risk of .05. We know that 

6* = zcr b 
= lMff b . 

(The normal deviate for an area of .05 in one tail is 1.64.) Now note 
that 


0b 


0YX 


__ cry-x 

TpTB 

\ n 

_ .572. 

Vm 
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Hence, 


b* = (1.64) (.081) 
- .133 . 


The decision rule, therefore, is: If the value of the slope (b), esti¬ 
mated from a sample of 25 from a bivariate population of number of 
orders and weight of mail, is equal to or less than .133, discard the 
regression line 5 if b is greater than .133, do not discard the regres¬ 
sion line. 

Since b is equal to 3.00 (in Example 13.13), we should not discard 
the regression line as a basis for estimating number of orders on the 
basis of weight of mail. 

Even though the decision is that the line should be considered as 
a basis for estimating the dependent variable upon the basis of the 
independent variable, we still must determine whether the esti¬ 
mates based on the regression line will be sufficiently precise to 
meet the requirements of a given problem situation. In terms of the 
mail-order house problem, we must determine whether the precision 
of the estimate of the number of orders, based on the weight of 
mail, is precise enough to make possible effective planning of a 
day’s operations. 

The Standard Error of an Estimated Mean . Since the estimated 
regression line is determined from a sample of observations, the 
estimates of the 7 -intercept (A) and the slope (J5) are subject to 
the possibility of sampling error. Therefore, the estimate of t* Y .x 
which depends upon the estimates of A and B will also be subject 
to the possibility of sampling error. 

For simplicity in notation let y be an estimate of i* Y . x for a given 
X, where y is computed from an estimated regression equation. 
Thus, 


pyx = y = aA-bX . 

We now shall consider the sampling error of this estimate for a 
given X-value. An estimator for the standard error is given by the 
following formula: 

1 1 , (X -*) 2 

where is the standard error of the estimate of the average 7-value 
for a given X-value, denoted by X. 

The first part of the expression under the square root sign re¬ 
flects the error in a, and the second part indicates the error in b. 
You will observe that 05 varies with X. It is at a minimum when 
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X is equal to the mean of the observed X-values. As the deviation 
oi the value of X from x increases in either direction, increases. 

Example 13.16 Let us now determine the standard error for the 
line of regression m the mail-order house problem. We have worked 
out in detail the calculation of the error in the average value of the 
number of orders (y) when the weight of the mail is 500 pounds 

In " 5 Z The err ° rS l° r 0ther values of Z are S iven ™ the table that 
follows the computation. 

a _ A 

(Tg — - T~~ - 

\n^2(x-xy 

The values of &y X and S(z - x) 2 were computed previously as .572 
Therefore^ 6 1313 a " d 6 ° ^ Example 1315 ), respectively. 


<r$ = .572 


1 , (5~ 
25 ^ 50 


= .14 for X = 5 . 

The above result of 14 means that if many, many regression lines 
were computed for the same X-values, then the resulting average 
value of the number of orders for 500 pounds of mail would have a 
standard deviation of .14 thousand orders (140 orders). 

Values of for 
Selected X- Values 


Observe that the value of 6 S is at a minimum when A' is 4 (the 
mean of the X-values) and increases as we deviate from the mean 
m either direction. 

The error in the lme, as we have stated, represents a sampling 
error. As with all sampling errors, the error can be decreased by in¬ 
creasing the sample size. A glance at the formula for will con¬ 
vince you of this. Both denominators, n and X(x — x)\ increase as 
sample size increases and hence & f decreases. 

The Standard Error of an Estimated Y-Value. We have con¬ 
sidered how the standard error for the mean of the F-values for a 
given Z-value can be computed. This error increases as X deviates 
from the average Z-value. It also can be decreased by increasing the 
sample size. This standard error is the result of the use of a sam- 
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pling procedure. It approaches zero if larger and larger samples are 

taken. . . . ,. ., 

We now turn to the standard error of an estimate of an individ¬ 
ual F-value for a given X-value. The standard error of this type of 
estimate is the result of two factors: (1) sampling, and (2) an im¬ 
perfect relationship in the population. Thus, an estimate of an indi¬ 
vidual F-value, even if the population were known, would be sub¬ 
ject so some error—this we know is indicated by the population 
standard deviation of regression. 

Let y be the symbol for an estimate of an individual F-value. 
Such an estimate may be determined from the regression equation. 
That is, a point on the regression line will be used as an estimate. 
Hence, let 

y = a + bX . 

Note that the estimate of an individual F-value is determined from 
the regression equation in exactly the same way that estimates of 
the average value of F for a given X-value are determined. How¬ 
ever, in the first case the standard error will be larger, for we are 
trying to predict an individual rather than an average value. 

Thus, while the standard error of an estimate of an average 
F-value may be estimated as 

* * /rr ix^F 

an estimate of the standard error of an individual F-value is 


A 

<Tj 




1 -I-h 

n 


(X - z) 2 
2(x - x) 2 


As you can see by inspection of these formulas, the latter always 
will be larger since it includes an extra one under the square root. 
Let’s illustrate the difference by a numerical example. 


Example 13.17 Let us continue with the mail-order, problem. 
Suppose that 5 hundred pounds of mail (X = 5) are received on a 
given day. On the average this will mean 


j&y.x = 1.20 + 3.00(5) 

= 16.2 thousand orders. 


We also may estimate that on the particular day F is 16.2 thousand 

orders; that is, y = 16.2. i T 

These estimates are the same, but their standard errors are not. In 

Example 13.16 we found that 

& g = .14 thousand orders 
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e f m J? te °J th ® avera § e T-value. The standard error of the 
individual estimate is, however 


Vy — a Y-x-yJ 
= .572 yj. 


. f 1 I (X- xY 

n 2(x — x) 2 


25 + 50 


= .59 thousand orders. 

taffie^l™ 3 t alUe l° f ,? for other values of x are included in the 
teble below. You should compare them to the corresponding esti¬ 
mates of (7y given m Example 13.16. 1 S 


X 

y 

ov 

2 . 

7.2 

.61 

3. 

10.2 

.59 

4. 

13.2 

.58 

5. 

16.2 

.59 

6 . 

19.2 

.61 


Again you will, notice that the standard error of the estimated 

th'e mea 6 n ofthe . wb , en * = 4 hundred pounds, which is also 

„XdeviaLJ t i UGS “^"P 1 ®- The standard error increases 
as A deviates from 4 in either direction. 

Interpretation of Standard Errors. For sufficiently large samples 
a regression estimate of p. Y . x or of an individual F-value will be 
normally distributed. In such cases the standard errors of these es¬ 
timates permit measurement of risks in the usual way—by refer- 
ence to the normal curve. Thus, the risk that an estimate of ,x 3 ,, 

. w! ,ra tri ! e value by L96tr 5 or more would be .05. The 
isk that it differs by 2.58<r ? or more would be .01. Similarly, an 
estimate of an individual F-value for a given X-value may be eval¬ 
uated by reference to the normal curve if n is large. The risk of an 

0 r 5 O etc qUaI t0 ° r greater than L9K hl SUCh an estimate w ould be 
sand*o£s fo 3 ; X 13 'i 7 We estimated as 16.2 thou- 


We now note that 


~ *14 thousand orders. 


jap = .&( tnousand orders. 


There is a risk of .05 that an estimate of fx Y . x will be in error bv 27 
or more thousand orders (that is, by 270 or more orders). Similar^ 

2.58<r 5 = .36 thousand orders. 
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Hence, there is a risk of .01 that an estimate of fi Y x will be m error 

by 360 or more orders. . . . . , 

For estimates of p Y -x at other levels of X, we find risks of exceed¬ 
ing the following errors: 


.. 


Risk: .05 

Risk: .01 

X 

1.961x5 

2.58<rg 

2 . 

.39 

.52 

3 . 

.27 

.36 

4 . 

.22 

.28 

5 . 

.27 

.36 

6 . 

.39 

.52 





These values are all interpreted in the way indicated above. 

Example 13.19 In Example 13.17 we considered the value of 16.2 
thousand orders as an estimate of an individual Y -value for X — 5. 
In this case, we estimated the standard error as 

= .59 thousand orders. 

Now note that 

lMffy = 1.16 thousand orders. 

Thus, there is a risk of .05 that an estimate of Y will be m error by 
1.16 or more thousand orders. Furthermore, since 

2.58o- y = 1.52 thousand orders, 

there is a .01 risk that an estimate of Y will be in error by 1.52 or 
more thousand orders. 

The errors in the estimate of F-values that would be exceeded 
5 times in 100 and 1 time in a 100 are shown below for some other 
values of X: 


X 

Risk: .05 
1.96£„ 

Risk: .01 
2.58 (Ty 

2 . 

1.20 

1.57 

3. 

1.16 

1.52 

4. 

1.14 

1.516 

5. 

1.16 

1.52 

6 . 

1.20 

1.57 


Compare these possible errors with those given in the last illus¬ 
tration. For a given level of risk and a given X-value, these are 
larger. Why? 

The proper interpretation of the standard errors of estimates of 
average and individual F-values is important because it enables us 
to evaluate the worth of a regression line as a basis for decisions by 
association. 
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tnl^+r pfe i3 ' 2 ° f Let US assume that the mail-order house can 
tolerate an error of up to 2 thousand orders in any day’s esLate 

of orders received. Further, let us suppose that the risk of an Sr 
equal to oi gi eater than 2 thousand orders should be only 05 Does 
the regression equation meet the needs of the mail older hoJse^ 

The answer is yes, if 1.96+ is less than 2 thousand orders for 

?mt kitz ;'sE ts 

.aoo-j, m Example 13.19. All are less than 2 thousand orders The 
estimated regression equation is satisfactory. 

Confidence Intervals. At times it may be preferable to use a 
egression equation to derive an interval estimate of p, Y x or of an 
dividual F-value, rather than compute point estimates. A con¬ 
fidence interval, as we have noted in an earlier chapter is a range 
of values which we hope includes the quantity being estimated It 
is constructed by a procedure which if repeatedly used would have 
a specified probability of including the quantity being estimated 

en C fi°j 1Slder co “P utatlon of an interval estimate for ^ Y . x . A 95 
onfidence interval for X = 5 hundred pounds would be: 

Af.s - 1.96£* to H Y . 5 + 1.96^, 

16.2 - 1.96(.14) to 16.2 + 1.96(.14) 

16.2 — .27 to ]6.2 + .27 

15.93 to 16.47! 

would be I 17 ’ a ■" °° nfidenCe interval for * = 5 hundred pounds 


Mr. 5 — 2.58+ 

16.2 - 2.58(.14) 

16.2 - .36 
15.84 


to fay -5 +2.58+ , 
to 16.2 + 2.58(.14) , 
to 16.2 + .36 , 
to 16.56. 


™^ n QG SampleS ° f f W6re drawn and il,tervals such as this com- 
of the “tervals would include the true value of 

The computation of an interval estimate for an individual 

JrlthTr'tha” ^ * T Ca k ulations - however, we must use 
05 7 L th rather than in the formulas. Thus a 

•95 confidence interval for X = 5 hundred pounds would be: 


y — 1,96+ to 

16.2 - 1.96(.59) to 

16.2 — 1.16 to 

15.04 to 


y + 1.96+, 
16.2 + 1.96(.59) , 
16.2 + 1.16 , 
17.36 


• J h °i °t 4 a,i0n of - 95 confidence intervals for p. rx and for an 
individual F-value at other Z-values is given on page 428 
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X 

M7X 

1.96 &y 

2 . 

7.2 

.39 

3 . 

10.2 

.27 

4 . 

13.2 

.22 

5 . 

16.2 

.27 

6 . 

19.2 

.39 


MF-X 


— 1.96 &y to jxY-X + 1*96 


6.8 to 7.6 

9.9 to 10.5 
13.0 to 13.4 
16.0 to 16.5 
18.8 to 19.6 


X 

y 

1.96(71, 

y — 1.96c v to y + 1 . 960 - 5 , 

2 . 

7.2 

1.20 

6.0 to 8.4 

9.0 to 11.4 

12.1 to 14.3 

15.0 to 17.4 

18.0 to 20.4 

3 . 

10.2 

1.16 

4 . 

13.2 

1.14 

5 . 

16.2 

1.16 

6 . 

19.2 

1.20 


Figure 13.5 presents these results, inoiu uiai ^ — *-—- 

for the estimates of Y is wider than that for the estimates of 
Also observe that both confidence bands are narrowest when , 

the mean of the X-values, and both get wider as the X-values de¬ 
part from this mean. 

13.7 Judgment Applications of Regression 

There are many occasions when a bivariate population of interest 
does not satisfy the conditions of the simple linear-regression model. 
For example, one assumption that is frequently violated is that t 
scatter of the 7-values about the regression line is normal. Another 
is that the standard deviation of regression is constant for all perti¬ 
nent values of X. In these situations the estimation procedures de¬ 
scribed above are still valid, but the risks of errors m the estimates 

usually cannot be precisely determined. 

Another problem arises in situations for which the X-values are 
not fixed for the study. Still another difficulty presents itself when 
the 7-values cannot be assumed to constitute simple random sam¬ 
ples. In such cases none of the statistical theory and methods given 

above applies. . . , , 

This does not mean that a regression line cannot be computed, in 

fact the same formulas can be used. Any prediction based on the 
regression line in these instances, however, cannot be made on the 
basis of the statistical theory discussed above. Predictions based on 
such regression lines must rely on the judgment of the expert using 
this technique. A common application of regression analysis m this 
connection is to forecast sales of a company. Very often a variable 
such as gross national product or disposable personal income is used 
as the independent variable. However, the relationship of the vana- 
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Figure 13.5 

.95 CONFIDENCE INTERVALS FOR AND FOR INDIVIDUAL 
V-VALUES AT VARIOUS VALUES OF X 



bles in the population does not satisfy the assumptions of the simple 
linear-regression model. For example, the X-values in such applica¬ 
tions cannot be fixed for purposes of the study. In addition, the devi¬ 
ations of the annual sales from the line of regression are related to 
each other, casting suspicion on the assumption of randomness. As 
a rule, if sales of one year are high, they will be followed by relatively 
high sales during the next year. 

Example 13.21 This illustration develops the relationship be¬ 
tween the annual sales of the Standard Oil Company of New Jersey 
and the nation’s output of goods and services as measured by gross 
national product. The scatter diagram for these two variables is 
shown in Figure 13.6. It indicates that the relationship between the 
variables may be described by a straight line. The regression line 
computed on page 431 also has been drawn on that chart. 




430 


STATISTICS FOR BUSINESS DECISIONS [Ch. 13 


Figure 7 3.6 

RELATIONSHIP BETWEEN SALES OF STANDARD OIL COMPANY 
OF NEW JERSEY AND GROSS NATIONAL PRODUCT 



GROSS NATIONAL PRODUCT (BILLIONS OF DOLLARS) 


WORK SHEET TO DETERMINE REGRESSION LINE 
SALES OF THE STANDARD OIL COMPANY OF NEW JERSEY 
ON GROSS NATIONAL PRODUCT 


Sales GNP 


Year 

(Billions of Dollars) 

(Billions of Dollars) 





y 

X 

xy 

x 2 

y 2 

1950. . . 

3.8 

285 

1,083.0 

81,225 

14.44 

1951.. . 

4.7 

328 

1,541.6 

107,584 

22.09 

1952. . . 

5.2 

345 

1,794.0 

119,025 

27.04 

1953. . . 

5.5 

363 

1,996.5 

131,769 

30.25 

1954.. . 

5.7 

361 

2,057.7 

130,321 

32.49 

1955. . . 

6.3 

392 

2,469.6 

153,664 

39.69 

1956. . . 

7.1 

415 

2,946.5 

172,225 

50.41 

1957. . . 

7.8 

440 

3,432.0 

193,600 

60.84 

Total... 

46.1 

2,927 

17,320.9 

1,089,413 

277.25 


Sources: 1957 Annual Report, Standard Oil Company of New Jersey, and Survey of Current Business, July, 
1958. 
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b = n2xv ~ 

nSx 2 — (Sx) 2 

= 8(17,320,9) - 134,934.7 
8(1,089,413) - 8,567,329 
= 138,567.2 - 134,9 34.7 
8,715,304 - 8,567,329 
= 3,632.5 
148,075 
= .02453 
a = y - bx 

= 5.763 - (.02453) (365.88) 
= 5.763 - 8.975 
= -3.21 

Hence, the equation of the regression line is 


y = -3.21 + .02453x . 


This equation may be used to estimate sales 

£Lf°, S ® nationaI P rodu ct (x). For example, 
S285 billion, as it was in 1950, we find 


(y) for different levels 
suppose that GNP is 


§ — —3.21 + (.02453) (285) 
= -3.21 + 6.99 
= 13.8 billion. 


To checlj on the fit of the regression line for the period 1950 to 
1957, we may compute values of y for each year and measure the er 
rors m the estimates. A Judgment decision must then be made on the 

e „r?nil a °:r for Mure »«»**“•"«% 

i° gifen bl““ 


Year 

Actual 

Sales 

y 

Estimated 

Sales 

y 

Error 
y - y 

Percentage Error 

*~"xioo 

y 

1950. 

3.8 

4.7 

5.2 

5.5 

5.7 

6.3 

7.1 

7.8 

a s 

0.0 

-0.1 

-0.1 

-0.2 

+0.1 

-0.1 

+0.2 

+0.2 

0.0 

1951. 

0.0 

1952. 

^.O 

K Q 

-2.1 

1953. 

0.0 

K 7 

-1.9 

1954. 

0. 1 

E A 

-3.5 

1955. 

0.0 

A 1 

+ 1.8 

1956 . 

1957 . 

0.4 

6.9 

7.6 

-1.6 

+2.9 

+2.6 

Total | 

46.1 

46.1 

0.0 



nf ! 7 t i hat th ? SUm ° f the estimated ^/-values is equal to the sum 
thp*» + < ^ Ual + va ! ues fl ud that, hence, the sum of the deviations of 
the actual values from the estimated values is zero. These relation¬ 
ships act as a check of the calculations. relation- 
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The computations indicate that in the past, the use of the regres¬ 
sion line to estimate sales would have resulted in errors that range 
from an underestimate of only 2.9 per cent to an overestimate of only 
3.5 per cent. On the average, the percentage error is 2.1 per cent. 11 
the company feels that it can tolerate errors of the magnitude ob¬ 
served, and that the same relationship between the variables will ex¬ 
ist in the future, the line may be adopted as a means of forecasting. 
Thus a forecast made in 1959 for 1960 would have to anticipate the 
value of gross national product for 1960, perhaps on the basis of an 
economist’s judgment. 


In the last example, there would be no point in trying to deter¬ 
mine if a relationship exists by seeing if b differed from B - 0 on 
the basis of statistical theory. The theory does not apply here. Simi¬ 
larly, the standard error of an estimate of sales for a^ given level ol 
GNP need not be computed since we could not attribute any sta¬ 
tistical meaning to it. The risks of such estimates can be evaluated 

only on the basis of judgment. . , 

The decision to use the line for purposes of prediction can depend, 
in part, on its past performance in predicting sales. If this past per¬ 
formance seems satisfactory and there are logical reasons to assume 
that this relationship will continue in the future, it might be de¬ 
cided to use the regression line as a basis for prediction. However, it 
is not possible to measure the risk of an error. Judgment, based on 
experience in the substantive field, and not on statistical theory, 
would be the basis of evaluating the prediction. 

Work is being carried on by statisticians to develop more compli¬ 
cated models than we have used so that problems like the one dis¬ 
cussed above might fall within the realm of statistical theory. Such 
developments, however, are beyond the scope of this text. 


13.8 The Population Coefficient of Correlation 

A regression equation, as we have stated, describes the nature of 
the relationship among variables. There are times, however, when 
we are not interested in the nature of the relationship but in the de¬ 
gree of relationship between the variables. One measure of the de¬ 
gree of relationship is called the coefficient of correlation (to be de¬ 
noted by P , the Greek letter rho ). 

Example 13.22 The personnel director of a corporation wantsto 
decide upon a set of psychological tests to administer to prospective 
candidates for an executive training program. He is to use the re¬ 
sults of these tests in conjunction with other criteria (e.g., education, 
experience, personality, diction) to select candidates. He obviously 
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would prefer tests which are most closely associated with successful 
performance m the training course. He is, thus, interested in the de- 
gree of association between grades on a given test and performance 
lne degree of such association may be measured by the coefficient 
of correlation. Tests which have the highest coefficients of correlation 
with performance are to be selected. Let us now assume that the pop¬ 
ulation regression line is known. The line and some of the points 
about the line are plotted in Figure 13.7. 

Figure 13.7 

RELATION BETWEEN PERFORMANCE RATING AND TEST SCOlh 



If we desired to predict performance of candidates on the basis 
of past performance without reference to any associated variables, 
we might predict that a candidate’s performance rating will be 
equal to the average performance rating in the population (> y ). 
The deviation (error) of an individual performance rating from the 
average could be expressed as ( Y — /x y ) and is represented by line 
AC in the above chart. This deviation (error) can be written as: 

AC = AB + BC 

or 

(Y ~ jxy) — (Y — fi Y -x) + (mf-x — jur) . 

If we square the both sides of the equation, do the same for all points 
in the bivariate population, and then sum all of these squares we 
obtain ? 

2(T — /x F ) 2 = 2(F — /x f*x) 2 -f S(ju F -x — Mf) 2 . 

Let us examine these three expressions. 

The term 2 (F — /x y ) 2 is a measure of the variability of the F-val- 
ues about their mean and is called the total sum of the squares. 
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The term S(mx,x - Mr ) 2 is the sum of the squared deviations of 
the values of Pr -x about their mean. This term is a measure of the 
explained variability since .it measures the variability of the /-'-r.x 
values which are determined on the basis of the regression line. It is 
referred to as the explained sum of the squares. 

Finally, the term S(7 — Pyx)' is a measure of the unexplained 
variability since it measures the difference between the observed 
Y- values and the value of Y we obtain on the basis of the regression 
relationship. It is called, the unexplained sum of the squares. 

We, therefore, can state that 

Total Sum of the Squares = 

Explained Sum of the Squares + Unexplained Sum of the Squares. 

We define P 2 (the square of the coefficient of correlation) as fol- 
lows: 

Explained Sum of Squares 
p ^ Total Sum of Squares 


or 


Unexplained Su m of Squares 
p2 “ 1 Total Sum of Squares 

Symbolically, we may write 


2 _ — Mr) 2 

p ~ 2(7 - py ) 2 


or 


P 2 = 


2(7 - ixvxf 
2(F - hyY 


In terms of our last illustration, p 2 represents the proportion 
of the variability in the population of performance ratings that is 
explained by variability in test scores. t 

If there were no relationship at all between the variables, Y 
would have the same average value for every X. In other words, 
, JlY ' X = ix Y and, hence, the explained sum of the squares would be 
equal to 0. Therefore, p 2 also would be 0. In other words, none of 
the variability in F would be explained by X. We would be just as 
as well off predicting a F-value on the basis of the mean of all 
the F-values (py). 

If the relationship were perfect, then the n Y -x values would be 
identical with the observed values. The explained variability would 
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equal the total variability, and P 2 would equal 1. That is, all the 
variability in Y would be accounted for by its relationship to X. 

. Hence > p 2 can range from 0 (no relationship) to 1 (perfect rela¬ 
tionship). Its square root, the coefficient of correlation, P) can be 
either positive or negative and, hence, can range from — 1 to +1. If 
the relationship of X and Y is direct—that is, as one increases the 
other increases the sign of p is positive. If the relationship is in¬ 
verse, the sign of p is negative. Thus, the coefficient of correlation 
and the slope of the regression relationship (J9) have the same sign. 

By convention, we refer to the degree of relationship as the coef¬ 
ficient of correlation ( P ). However, it is simpler to express its mean¬ 
ing in terms of P 2 , which also is known as the coefficient of determi¬ 
nation 

In the above discussion we have assumed that the bivariate popu¬ 
lation was known. In practical situations, we must estimate the co¬ 
efficient of correlation from a sample. In order to use statistical 
theory, we must make assumptions about the population. The sim¬ 
ple linear-regression model is not a satisfactory model for this pur¬ 
pose. It is necessary to assume a more elaborate model which is 
known as the bivariate normal distribution . 

The assumptions of this correlation model may be summarized as 
follows: 

1. The mean of the F-values for each given X {^ Y . x ) is on a re¬ 
gression line: 

jUY-x = A +BX. 

2. The mean of the X-values for each given Y (p z . Y ) is on an¬ 
other regression line: 

»x-y = A'+ B'Y . 

(The two regression equations will be the same only if P 2 equals 1.) 

3. The F-values for every X-value are normally distributed 
about the regression line with constant variance (<r r . x 2 ). The 
X-values for each F-value are also normally distributed about the 
regression line with constant variance (o- x . r 2 ). 

In addition, this correlation model, unlike the simple linear-re¬ 
gression model, does not assume that the X-values are fixed (error- 
free). Therefore, the sampling schemes differ. In the regression 
model, sample F-values are observed for various values of X—these 
values of X are fixed by the investigator. In the correlation model, 
pairs of X- and F-values are observed on a sample of elementary 
units selected at random. 
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13.9 Sampling Theory for Correlation 

In some situations a decision must be made as to whether or not 
association between two variables exists; that is, as to whether or 
not p = 0. Let us see how this decision can be made on the basis of 
a simple random sample of observations if the assumptions of the 
model described above hold. 

First of all, we note that the sample coefficient of correlation is 
the pertinent statistic. It will be denoted by r, and it represents the 
degree of association in a sample. A simple formula for computing 
the value of r for a given sample is 

_ riZxy — XxZy _ 

V V[n2x 2 — (2z) 2 ] [nZy 2 — (%y) 2 ] 

Let us first consider the computation of r from this formula. 

Example 13.23 Two crews are making fuel tanks. The propor¬ 
tion of defective tanks varies from day to day for both crews. Man¬ 
agement wishes to determine whether the same causes affect the op¬ 
eration of both crews. They reason that if the proportions of defec¬ 
tives for both crews are positively associated with each other from 
day to day, then common causes are most likely affecting their work. 
If there is no correlation, then most likely different factors are affect¬ 
ing the work of each crew. 

Let us assume the following sample data are given. A small sample 
is used to simplify the arithmetic. 


WORK SHEET FOR COMPUTING VALUE OF r 
Percentage of Tanks with Leaks 


Day of Month 

Crew A 
(*) 

Crew B 

(y) 

xy 

x 2 

V 2 

1 . 

8 

25 

200 

64 

625 

2 . 

6 

11 

66 

36 

121 

3. 

6 

1 

6 

36 

1 

4. 

9 

3 

27 

81 

9 

5. 

1 

10 

10 

1 

100 

Total. 

30 

50 

309 

218 

856 


riLxy — 'LxZy _ 

~~ V^Sa; 2 - (Sx) 2 ] [nZy 2 - (?y) 2 ] 

5(309) - 30(50)_ 

~ V[5(218)^-~(30) 2 ] [5(856) - (50) 2 J 

_ 45 

~ VXl90) (1,750) 

- .08 
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The value of r provides an estimate of p. Thus, we may write, 
for Example 13.23, 


p = .08 . 

This can be taken to mean that p 2 is equal to about .0064. Since p 2 is 
the ratio of explained variation to total variation, we thus estimate 
that: Less than 1 per cent of the variation in the percentages of de¬ 
fective tanks made by crew A is associated with the percentages of 
defective tanks made by crew B. 

Of course, since r is a random variable, the above estimate of p is 
subject to the risk of sampling error. The measurement of this er¬ 
ror, however, is beyond our survey of the matter. 

In any event, the decision problem formulated in the illustration 
is not one of estimation. It is one of deciding whether p is equal to 
or less than 0 against the alternative that p is greater than 0 (i.e., 
that it is positive). Certainly a value of r equal to .08 might have 
come from a population for which p = 0. 

Let us, therefore, determine a statistical decision rule to decide 
the problem. We consider only the case for which the sample size 
is fixed. In this case only one risk, the a-risk, need be set. Let a be 
the risk of assuming some positive association when actually p = 0. 
The table below gives critical values of r for a = .05 and a:= .01 
for different sample sizes. If the value of a sample r is equal to or 
less than the critical value of r given in the table, we should assume 
that no positive relationship exists. If the sample r is greater than 
the critical value, we should assume that the variables are posi¬ 
tively correlated. 


CRITICAL VALUES OF THE 
CORRELATION COEFFICIENT 


Sample Size 

a-Risk 

n 

.05 

.01 

5 

.81 

.93 

10 

.55 

.72 

20 

.38 

.52 

50 

.23 

.32 

100 

.16 

.23 


Let us assume that in Example 13.23 management is willing to 
assume an a-risk of .05. That is, they are willing to run the risk of 
assuming that correlation exists when actually there is none 5 times 
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in 100. Since r = .08 is less than the critical value of r (.81) for n — 
5, they should conclude that there is no relationship between the 
proportion of defective fuel tanks produced by the two crews. 
Hence, they should conclude that different cause systems are af¬ 
fecting the work of the two groups, and they should act accord¬ 
ingly. 

13.10 Rank Correlation 

To apply the statistical decision rule covered in the last section 
properly, one must assume a bivariate normal population. If no such 
assumption is warranted, a rule based on an alternative statistic may 
be formulated. This statistic is called the coefficient of rank correla¬ 
tion. Its use requires that we reach a decision, not on the basis of 
the degree of association between actual observations shown in a 
sample but rather on the basis of the association between the ranks 
of the observations. Let us consider the following illustrative situ¬ 
ation. 

Example 1S.24 It has been suggested to a sales executive that 
greater technical knowledge of the company’s product by salesmen 
would increase sales and, hence, that it would be advisable to intro¬ 
duce a short training program for salesmen. As a test of the plausi¬ 
bility of this suggestion, he decides to investigate whether salesmen 
with greater technical knowledge actually do sell more. 

Ten salesmen have been selling the product for the past 6 months. 
These salesmen are given a test to measure their technical knowl¬ 
edge. The test grades are to be compared with data on the extent to 
which the salesmen met, exceeded, or fell short of their sales quotas. 
The data are as follows: 


Salesman 

Test Score 
(Per Cent) 

Percentage of 
Quota Met 

A. 

63 

128 

B. 

82 

75 

C. 

96 

150 

D. 

57 

85 

E. 

82 

115 

F. 

90 

110 

G. 

50 

100 

H. 

70 

104 

I. 

74 

100 

J. 

87 

90 


These 10 bivariate observations will be the basis for a decision on 
the plausibility of the claim. A positive association would indicate 
that there is a degree of validity to the claim that a greater technical 
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knowledge by salesmen enhances sales. No relationship, or a nega¬ 
tive relationship, would indicate a lack of validity. 

If the coefficient of rank correlation is to be used as the pertinent 
statistic, it is first necessary to rank the observations for each vari¬ 
able. The actual observations given above are ranked in descending 
order in the work sheet that follows: 


WORK SHEET FOR COMPUTATION OF COEFFICIENT 
OF RANK CORRELATION 


Salesman 

Test Score Rank 

Percentage of Quota 
Rank 

d 

. » 

d 2 

A. 

8 

2 

6 

36 

B. 

4.5 

10 

-5.5 

30.25 

C. 

1 

1 

0 

0 

D. 

9 

9 

0 

0 

E. 

4.5 

3 

1.5 

2.25 

F. 

2 

4 

-2 

4 

G. 

10 

6.5 

3.5 

12.25 

H. 

7 

5 

2 

4 

I. 

6 

6.5 

-0.5 

0.25 

J. 

3 

8 

-5 

25 

Total. 



0 

114.00 


Observe that salesman C with the highest test score (96) has a 
rank of 1; salesman F with a score of 90 has a rank of 2, and so on. 
Note, however, that there is a tie between salesman B and E for 
fourth and fifth place. Each salesman is, therefore, given a rank of 
4.5—the value midway between 4 and 5. The observations for the per 
cent of quota met are ranked in a similar fashion. Again note that 
salesman G and I are tied for sixth and seventh place with 100 per 
cent. Each salesman is given a rank of 6.5—the value midway be¬ 
tween 6 and 7. 

The value of the coefficient of rank correlation (designated as r f ) 
is given by the following formula: 


n(n 2 — 1) ’ 

where d is the difference between ranks, and n, the sample size. The 
values of d and cP for this example are computed in the work sheet 
given above. The value of r' for the relationship between technical 
knowledge and sales performance is 

/ = 1 _ 6 ( 114 ) 

10(100 - 1 ) 

_ 1 684 

990 
= 1 - .69 
= .31 . 
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The coefficient of rank correlation is a statistic and is subject to 
sampling error. Its value of .31 in the last example does not nec¬ 
essarily imply positive association in the population. A statistical 
decision rule to provide a basis for the decision on assuming or not 
assuming positive association must be formulated. To formulate 
such a rule, an a-risk must be fixed where a is the risk of assuming 
a positive association if actually there is none. The table below 
gives critical values of C for a — .025, for some sample sizes. If the 
value of a sample f is equal to or less than the critical value of 
r' given in the table, we should assume that no positive relationship 
exists. If the sample r' is greater than the critical value, we should 
assume that the variables are positively correlated. 

CRITICAL VALUES OF THE COEFFICIENT 
OF RANK CORRELATION 


Sample Size a-Risk 

n .025 

10.65 

15.43 

20.37 

25.34 

30.31 

50.23 

100.16 


Let us assume that an a-risk of .025 is assumed for the problem 
described in Example 13.24. This is the risk of assuming a positive 
relationship when actually no association exists. Since r* is .31 and 
this is less than the critical value (.65), for n— 10, no positive 
association should be assumed. The decision would be not to intro¬ 
duce the specialized training program at this time. 

As noted earlier, the advantage of using the coefficient of rank 
correlation rather than r itself as the criterion for decision is that 
no assumption of a bivariate normal population need be made. 
Since this assumption of normality often cannot be made in a prac¬ 
tical problem, rank correlation has wide applicability. However, 
there are disadvantages to its use. For one thing, the use of r' rather 
than r usually will involve greater /3-risks. Secondly, the use of 
r r in a decision rule actually results in a test of whether or not 
there is any rank correlation in the population. Strictly speaking, 
it does not provide a basis for deciding whether or not p = 0. This 
is important because a high coefficient of rank correlation does not 
necessarily indicate a high coefficient of correlation. The rank meas- 
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ure takes only the ranks of the observations and not their magni¬ 
tudes into consideration. 

13.11 You Should Now Know That 

Decisions by association involve decisions made on the basis of 
the relationship between associated variables. 

There are two aspects of decisions of association: (1) regression 
—which relates to the nature of the relationship, and (2) correla¬ 
tion—which relates to the degree of the relationship. 

A simple regression relationship is one between two variables— 
a dependent variable and one independent variable. 

Multiple regression relationships involve a dependent variable 
and more than one independent variable. 

Regression relationships also may be linear or nonlinear, de¬ 
pending on whether or not a straight line describes the relation¬ 
ship. 

Scatter diagrams are useful graphic devices for investigating the 
nature of the relationship between two variables. 

Association does not necessarily imply causation. 

One useful statistical model in many regression problems is the 
simple linear-regression model. 

There are two sources of errors in predictions based upon regres¬ 
sion estimates: (1) errors due to an imperfect relationship, and 
(2) errors due to sampling. 

The errors due to an imperfect relationship in the population are 
measured by the standard deviation of regression. 

To interpret any regression estimate properly, one must consider 
the standard error of that estimate. 

In some correlation problems, a bivariate normal population may 
be assumed. 

In such a case, the coefficient of correlation, p, is a measure of the 
degree of relationship. 

The square of this measure is called the coefficient of determina¬ 
tion, p 2 , and may be interpreted as the proportion of the variability 
in a dependent variable which is explained by its association with 
the independent variable. 

The sample correlation coefficient, r, is a statistic which may be 
used as a basis for deciding whether or not p is zero. 

An alternative test statistic, the coefficient of rank correlation, may 
be used as a basis for deciding whether or not association in a popula- 
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tion exists even if no assumption of normality in the population is 
warranted. 

13.12 You Should Now Be Able to Solve 

1. In each of the following situations would a regression model 
or a correlation model be applicable? Explain each of your answers. 

a) A study is to be conducted to determine the relationship between 
gasoline consumption and the speed of an automobile. 

b ) A study is to be conducted to determine the relationship between 
high school grades and college grades. 

c) A study is required in order to determine the relationship between 
the w T eight of a plane and the distance required for take-off. 

d ) The relationship between education of employees and rate of turn¬ 
over on a given job is to be studied. 

2. In each of the following situations, discuss whether or not a 
cause-and-effect relationship exists between the variables. Explain 
your answers. 

a) A positive relationship between net sales/assets ratios and net 
sales/inventory ratios for various companies in the same industry. 

b) The direct relationship, over time, between the annual real estate 
tax rate and the total value of real estate in a community. 

c ) The direct relationship between heights of school children and their 
reading abilities. 

d) The direct relationship between the total liabilities of a company 
and the price of its common stock over a period of years. 

3. For each of the following sets of data draw a scatter diagram 
and discuss whether the relationship appears to be linear or curvi¬ 
linear. 


(a) 

Speed of Automobile 


(Miles per Hour): 10 

20 30 40 50 

60 

70 

80 

90 


Distance Necessary to 

Stop after Signal (Feet): 16 

42 78 124 180 

(b) 

246 

322 

408 

504 


Hardness of 

Synthetic Rubber: 45 55 

61 56 66 71 

86 

48 

62 

80 


Abrasion Loss of 

Synthetic Rubber: 322 286 

Experience with Assembly 

251 284 246 187 

(c) 

130 

310 

241 

165 


Operation (Days): 5 

5 5 10 

10 

10 

15 

15 

15 

Time Taken to Perform 

an Assembly (Minutes): 36.2 

41.8 35.7 22.8 

19.3 

24.9 

17.0 

17.5 

16.8 
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4. A home laundry service desires to determine the relationship 
between age of trucks and annual repair costs. It will use the rela¬ 
tionship to estimate repair costs, and to determine a replacement 
policy for its fleet of trucks. The following data are assembled: 


Truck 

Age 

(Years) 

Annual Repair Cost 
(in Dollars) 

1. 

1 

22 

2. 

1 

75 

3. 

1 

104 

4. 

1 

0 

5. 

1 

0 

6. 

2 

37 

7. 

2 

69 

8. 

2 

81 

9. 

2 

148 

10. 

2 

46 

11. 

3 

105 

12. 

3 

102 

13. 

3 

188 

14. 

3 

65 

15. 

3 

36 

16. 

4 

108 

17. 

4 

63 

18. 

4 

235 

19. 

4 

214 

20. 

4 

127 

21. 

5 

132 

22. 

5 

117 

23. 

5 

174 

24. 

5 

127 

25. 

5 

153 


a) Plot a scatter diagram for these data. 

b) Estimate a regression line to describe the relationship between an¬ 
nual repair cost and age of truck. 

c ) Discuss the extent to which the assumptions of the simple linear- 
regression model are valid for this problem. 

d) Should it be concluded that a direct linear relationship exists be¬ 
tween the variables? Explain your reasoning. 

e) Would you recommend the relationship as a means of estimating 
repair costs? Explain. 

5. Many states employ the assessed valuation of real property 
as a criterion of the ability of units of local government to finance 
their own operations. As a check on the adequacy of this criterion, 
one state wishes to compare the per capita assessed valuation of gov- 
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ernmental units with their median family income. A sample of 25 
districts is selected, and the results are as follows: 



Per Capita 

Median 


Assessed 

Family 

District 

Value 

Income 


(in Dollars) 

(in Dollars) 

1. 

2,600 

3,570 

2. 

2,810 

3,680 

3. 

2,820 

4,420 

4. 

2,800 

4,070 

5. 

2,540 

3,870 

6. 

3,020 

3,940 

7. 

2,870 

4,290 

8. 

2,710 

4,040 

9. 

2,680 

3,550 

10. 

2,650 

3,320 

11. 

2,540 

3,560 

12. 

2,610 

3,510 

13. 

2,760 

3,900 

14. 

2,820 

4,320 

15. 

2,630 

3,790 

16. 

2,460 

3,570 

17. 

2,920 

4,350 

18. 

2,610 

3,920 

19. 

2,600 

3,790 

20. 

2,480 

3,510 

21. 

2,860 

4,050 

22. 

3,150 

4,520 

23 ...:. 

2,440 

3,580 

24. 

2,930 

3,420 

25. 

2,620 

3,600 


a) Estimate the population coefficient of correlation and interpret 
your result. 

b) Discuss the extent to which the assumptions of the bivariate nor¬ 
mal population model are valid for this problem. 

c) Should it be concluded that some positive association exists be¬ 
tween the variables? Explain your reasoning. 

6 . For the problem situation given in Question 5, formulate a 
statistical decision rule in terms of the coefficient of rank correla¬ 
tion. Apply the decision rule. Why do the values of the coefficient of 
correlation and the coefficient of rank correlation computed for this 
and the last question differ? 

7. The following are data for disposable personal income and 
consumer expenditures for furniture from 1947 through 1957: 
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Year 

Expenditures for 
Furniture 
(Billions of 
Dollars) 

Disposable 
Personal In¬ 
come (Billions 
of Dollars) 

1947. 

2.55 

170 

1948. 

2.78 

189 

1949. 

2.69 

190 

1950. 

3.07 

208 

1951. 

3.18 

228 

1952. 

3.39 

239 

1953. 

3.58 

252 

1954. 

3.64 

257 

1955. 

4.18 

274 

1956. 

4.43 

290 

1957. 

4.34 

305 


a) Plot the scatter diagram for the data. 

b ) Are the assumptions of the simple linear-regression model satisfied 
in this problem? Explain why or why not. 

c) Determine a regression equation. 

d) How can you determine the accuracy of this relationship? 

13.13 You Will Also Find That 

The topics which we have considered in this chapter—including 
correlation and regression in general, the simple linear-regression 
model, standard errors, the coefficient of correlation, and so forth— 
also are discussed in the following textbooks: 

Hirsch, Werner Z. Introduction to Modern Statistics , pp. 247-80. New 
York: Macmillan Co., 1957. 

McCarthy, Phillip J. Introduction to Statistical Reasoning, pp. 332-88. 

New York: McGraw-Hill Book Co., Inc., 1957. 

Sprowls, R. Clay. Elementary Statistics, pp. 222-59. New York: 
McGraw-Hill Book Co., Inc., 1955. 

Wallis, W. Allen, and Roberts, Harry V. Statistics: A New Ap¬ 
proach, pp. 524-56. Glencoe, III: The Free Press, 1956. 

















CHAPTER ■ 14 


Forecasting with Time Series 


14.1 Some Remarks on Forecasting 

In this chapter we shall concern ourselves with some aspects of 
business forecasting. Forecasting problems arise when decisions must 
be made concerning future populations. But since future populations 
cannot be studied directly, forecasting involves the methods and 
theory of making decisions for the future on the basis of a study of 
past and present populations. 

We already have considered such problems in previous chapters. 
Thus, in quality control, the pattern of process variability in the 
future is predicted on the assumption that a process remains in con¬ 
trol. Then, again, in decisions by association, predictions are made 
about the future behavior of a variable on the basis of its present re¬ 
lationship with an associated variable. 

In each of these instances predictions are statistical forecasts. 
However, it is impossible to forecast the future precisely—there al¬ 
ways must be some range of error allowed for in the forecast. Sta¬ 
tistical forecasts are those for which we can use the mathematical 
theory of probability to measure the risks of errors in predictions. 
To apply statistical theory for prediction, however, we had to make 
certain assumptions. To use the quality control chart for purposes 
of prediction, we assumed that the process was a random process— 
operating in statistical control. In order to use the regression line to 
predict, we had to assume that the population satisfied the regression 
model. 

There are many forecasting problems that businessmen face for 
which the populations of interest do not satisfy the conditions neces¬ 
sary for the application of statistical theory. In these situations, al- 
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though the techniques used may be the same, the risks of an incor¬ 
rect forecast must be evaluated on the basis of expert judgment. 
The decision procedure then is only partly statistical. We already 
have illustrated this point with reference to the judgment applica¬ 
tions of regression relationships. 

Unfortunately, forecasting is a necessary evil. It involves future 
planning by businessmen who must continually determine their 
needs and demands for tomorrow, next week, and next year. A busi¬ 
ness may face decisions of whether or not to increase production 
during the second half, of what sales quotas should be set for the 
first quarter, and so on. To ignore the existence of these problems 
and to refuse to forecast is a rather naive approach. “No forecasting” 
is, in itself, actually a forecast that business conditions will not 
change. 

The remainder of this chapter will be devoted to a discussion of 
time series analysis and its use in forecasting. Unfortunately, there 
is as yet no complete body of statistical theory developed for this 
subject. Therefore, the application of the methods to be discussed 
to forecasting problems will depend to a large extent on expert judg¬ 
ment. However, the statistician by pointing out the limitations of 
these methods can be of assistance to the expert in the substantive 
field where such techniques will be applied. The methods reduce the 
area within which judgment must operate and thus sharpen the 
analysis. 

14.2 Time Series and Forecasting 

A time series consists of a set of observations made at different 
periods of time. We already have encountered time series in our 
study of control charts. A control chart presents sample means, 
ranges, proportions, or standard deviations observed at uniform in¬ 
tervals of time. 

Example 14.1 Two control charts are illustrated on the following 
page. The first chart (a) is the control chart for the average shearing, 
strength of spot welds. This chart is similar to those presented in 
Chapter 11. 

Chart (b) is a control chart for the average length of steel bars. 
This control chart presents a time series which differs from those we 
have considered heretofore. It represents a condition in which -the 
tool used to cut the bars is subject to rapid wear. 
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a) CONTROL CHART FOR MEAN SHEARING STRENGTH 
OF SPOT WELDS 



b) CONTROL CHART FOR MEAN LENGTH OF STEEL BARS 



Chart (a) in the preceding example presents a very simple time 
series. The population (process) mean remains constant from one 
instant of time to the next. The individual sample means are dis¬ 
tributed randomly about the population mean. The deviations about 
the population mean are independent of each other. That is, our 
knowledge of the fact that one sample mean is above the population 
mean in no way enables us to predict whether the succeeding sample 
means will be above or below the population mean. 

Chart (b) represents a more complicated time series. Here, be¬ 
cause of tool wear, the population average is shifting from one day 
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to the next. However, the population mean shifts in a systematic 
manner—3 ten-thousandths of an inch per day. In other words, the 
shift in the population mean is associated with time, and the line of 
regression of the population mean on time has a slope of .0003 inches 
per day. Thus, part of the upward drift in the sample averages is ex¬ 
plained by tool wear from day to day. The difference between the 
sample mean observed and the population mean for that day, how¬ 
ever, is a random event, and again the fact that the sample for one 
day will be above or below the population mean gives us no clue as 
to the nature of the deviation for succeeding or preceding days. We 
can say that the time series in chart (b) is composed of two ele¬ 
ments: (1) a systematic element represented by the regression line 
showing the day-to-day change in the population average that is 
explained by tool wear, and (2) a random element due to a host of 
chance factors. 

Most time series we encounter in business data, outside the field 
of quality control, are much more complicated than the time series 
illustrated in Example 14.1. Furthermore, in most time series the 
deviations of the observations about the population means are not 
independent but related to each other. When the values of the ob¬ 
servations that follow each other or are near each other are corre¬ 
lated, we refer to this type of correlation as serial correlation. The 
presence of serial correlation makes it impossible to use the methods 
described earlier in this text in order to forecast the pattern of future 
behavior of a series on the basis of its past behavior. 

Methods beyond the scope of this book do exist for estimating the 
degree of serial correlation in time series data. Similarly it is possible 
to measure the standard error of an estimate derived from data that 
are serially correlated, if the degree of correlation is known. Much 
work is being done currently on the subject of time series in the field 
of theoretical statistics. To date, no body of techniques that can be 
widely applied to practical business problems has been developed. 
One of the great obstacles has been the fact that the newly devised 
techniques require long series of computations. Perhaps the develop¬ 
ment of electronic computers will make such techniques feasible in 
the future. Until then, it will be necessary to put heavy reliance on 
judgment to forecast the pattern of behavior of future events. 

14.3 Graphic Presentation of Time Series 

In working with time series it is desirable to present the data 
graphically. Such presentations assist our analysis of time series 
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much as a scatter diagram did in the case of a regression problem. A 
graph simplifies the study of the movements of the data. 

In visualizing changes in time series we are interested in two types 
of comparisons: absolute changes and relative changes. 

Example 14.2 Consider the following figures of consumer install¬ 
ment credit: 

1945—$ 5.5 billion 
1950—$14.5 billion 
1955—$27.8 billion 

During the 5-year period 1945-50, consumer installment credit in¬ 
creased $9.0 billion. During the 5-year period 1950-55, the increase 
was $13.3 billion. Thus, the absolute increase (in dollars) was 
greater during the second 5-year period than during the first. 

The relative increase between 1945 and 1950 was approximately 
164 per cent. Between 1950 and 1955 the relative change was approx¬ 
imately 92 per cent. Thus, although the absolute change was larger 
during the second 5-year period, the relative change was smaller. 

To get a visual impression of absolute changes in data, uniformly 
spaced or arithmetically scaled graph paper is used. We could not 
determine relative changes from time period to time period from 
such charts (without resorting to computations). Relative changes, 
however, can be portrayed visually on semilogarithmic charts (al¬ 
though only the absolute values of the data need to be plotted). 
These charts are ruled with an arithmetic scale on the X-axis and a 
logarithmic scale on the F-axis—hence, the term “semilogarithmic.” 
A logarithmic scale is ruled in such a manner that when we plot a 
value, we actually are plotting its logarithm. The scale is ruled in 
the same way as the scales used for multiplication and division on a 
slide rule. Therefore, when data are plotted on a logarithmic scale, 
equal vertical distances represent equal relative changes rather 
than equal absolute changes. 

Example 14.S Figure 14.1 presents the data of consumer install¬ 
ment credit according to both arithmetic and logarithmic scales. 

On the arithmetic scale chart the vertical distance (d\) between 
the two points A and B is smaller than the vertical distance (d 2 ) 
between points B and C. This is so because the absolute change be¬ 
tween 1945 and 1950 is smaller than the absolute change between 
1950 and 1955. Arithmetic scale graphs picture absolute changes more 
clearly. 

On the semilogarithmic chart the vertical distance (di) between 
A and B is greater than the vertical distance (d 2 ) between B and C. 
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CONSUMER INSTALLMENT CREDIT IN THE UNITED STATES 
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is BTPntPT. +/ + f K 4 t ^ a , t . the ^ eIatlv e change between 1945 and 1950 
,L gre l ter . than the relative change between 1950 and 1955 Semi- 
logarithmic graphs depict relative changes more clearly. 

. Logarithmic scales are ruled as in Figure 14.2. The scale may be 
single tiered as m (a), double tiered as in (c), or multiple tiered, 
mce equal vertical distances represent equal ratios, the distance 
rom 1 to 2 is the same as the distance from 2 to 4, and from 3 to 6 
and so on. For the same reason the distance from 1 to 2 (ratio 2:1) 
is greater than the distance from 2 to 3 (ratio 3:2 or 1.5 to 1). The 
greater the relative change, the greater is the vertical distance. 

Example14.4 Figure 14.2 presents five illustrations of logarith¬ 
mic scales. Scale (a) illustrates the manner in which most types of 

qcXTlT- Y prepar , ed , Lganthmic scales appear when purchased. 

calc (6) is prepared for data which range from $10 to $100 You 
wdl notme that since number 1 on the scale is set at $10, number 2 
must be designated as $20. Thus, 3 is $30, and so on. 

Scalp (c) is arranged for a two-tiered semilogarithmic chart of 
data that vary between 10 and 1,000 tons. Scale (d) is prepared for 
data that range from 700 to 7,000 employees. Scale (e) may be used 
for data which range from $.50 to $50. 

You will observe that on each of these scales equal vertical dis¬ 
tances represent equal ratios. 

Semilogarithmic charts also are used at times to present data 
which vary widely in absolute amounts. At other times we may de¬ 
sire to present two types of data where the magnitudes of one series 
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vary widely from those of the second. Here again a logarithmic chart 

facilitates graphic comparisons. _ 

Where semilogarithmic charts are used in reports, care must be 


Figure 14.2 

ILLUSTRATIONS OF LOGARITHMIC SCALES 
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taken to explain that equal vertical distances on the chart represent 
equal ratios, or equal relative changes. Otherwise, such charts may 
be misinterpreted by readers who are not acquainted with this form 
of graphic presentation. 
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14.4 Nature of Economic Time Series 

The control chart for mean length of steel bars in Example 14.1 
presents a time series composed of a systematic element and a ran¬ 
dom element. The systematic element causes the shift in the popu¬ 
lation mean from one period to the next, while the random element 
accounts for the variability about that mean. Economic time series 
also can be described in such terms. However, the systematic 
element in economic time series is much more complicated than in 
the quality control illustration. There, the systematic element re¬ 
flected the influence of a single factor—tool wear. Economists iden¬ 
tify three types of systematic elements that influence the behavior 
of economic time series: (1) trend, (2) seasonal, and (3) cyclical. 
Each of these factors is the result of different forces affecting the 
data. In addition to the three systematic elements, there is a random 
element. 

The trend of a time series is the resultant of the long-time, slowly 
moving forces affecting the data. Such forces include population 
growth, technological changes, changes in habits and customs, and 
so on. Trends can be described qualitatively as moving upward, 
downward, or horizontally (no trend). 

Figure 14.3 includes three economic time series. You will observe 
that each series reflects a gradual long-term movement-—trend. 
Thus, while the growth of the population of the United States re¬ 
flects a continuous upward trend, farm population reflects a down¬ 
ward trend. The index of industrial production also reflects an up¬ 
ward trend. 


Figure 14.3 

o) POPULATION OF THE UNITED STATES, 1790-1950 
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b) FARM POPULATION IN THE UNITED STATES, 1910-55 



Charts (b) and (c) indicate that in addition to the long-term 
trend, there are oscillatory movements. Thus, farm population hit a 
low in 1930, increased in 1935, and then continued to decrease again. 
The increase in farm population during the thirties resulted from 
the lack of other opportunities for employment because of depressed 


c) INDEX OF INDUSTRIAL PRODUCTION, 1919-57 
{1947-49 - 100) 




Oh. 14] FORECASTING WITH TIME SERIES 455 

business conditions. The oscillatory movements from peaks to 
troughs and back to peaks in the Index of Industrial Production also 
reflect the influences of general business activity. Thus, the business 
recessions of 1921, 1924, 1927, 1932, 1938, 1949, 1954, and 1957 are 
clearly indicated. Such oscillatory movements, of more than one 
year in duration, that are related to ups and downs in general busi¬ 
ness are designated as cycles. You will observe that not all up and 
down movements are cycles. Thus, the peak in 1944 and the trough 
in 1946 are the result of the sharp increase in industrial production 
due to the war and the subsequent return to peacetime conditions. 


Figure 14.4 

INDEX OF DEPARTMENT STORE SALES IN THE UNITED STATES, 1955-58 

(1947-49 = 100) 



Figure 14.4 presents department store sales for the period 1955-58. 
The trend of this time series is slightly upward. There also is a cycli¬ 
cal peak in 1956 and a trough in 1957. The trend and cyclical move¬ 
ments, however, are obscured by the sharp up and down swings dur¬ 
ing the course of each year. This third movement in the data occurs 
regularly every year. For example, department store sales fall to a 
low at the beginning and middle of each year, and reach a peak dur¬ 
ing Easter, and a much higher peak at the end of the year. Such 
variations, which occur regularly during the period of a year, are 
known as seasonal fluctuations. These movements may result from 
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natural causes such as climatic conditions. Thus, ice cream sales in¬ 
crease during the summer, as does building construction. However, 
natural causes do not account for all seasonal fluctuations—some 
causes are man-made. These include (1) the custom of giving gifts 
at Christmas, Mother’s Day, and Father’s Day; (2) the change of 
style in clothing in the spring or fall; (3) the desire to purchase new 
cars in the spring; (4) the change in the date of the automobile show 
to counteract this desire; and so on. 

Thus, Figure 14.4 portrays a monthly seasonal movement. How¬ 
ever, you will observe that if we were to plot annual data, the 
monthly seasonal fluctuations would be obliterated. Seasonal move¬ 
ments also can be quarterly, weekly, daily, or hourly. Thus, if we 
had weekly data of department store sales, we might discover a reg¬ 
ular pattern of fluctuation during the month. Such movements are 
obscured, however, when monthly data are used. If daily data were 
available, regular fluctuations within a w T eek could be observed. 
Thus, commercial banks transactions reach a peak on the day when 
firms in the area pay their employees. If hourly data were available, 
then the movements within the course of a day could be observed. 
For example, such hourly seasonal movements are important to 
restaurants. 

Finally, in each of these charts there are certain movements which 
can be classified as random movements. That is, they result from a 
host of chance forces. 

Time series analysis concerns itself with identification of the sys¬ 
tematic elements. On the basis of past trends, past seasonal varia¬ 
tions, and past cyclical movements, attempts are made to forecast 
the future course of events. The existence of serial correlation, how¬ 
ever, makes it difficult to identify the systematic movements. Thus, 
a random event might result in a wide deviation from the population 
average. Because of serial correlation, the succeeding values would 
move back slowly to the population mean. Thus, if the set of ob¬ 
servations were analyzed soon after the occurrence of a random 
event, the random movement might be misinterpreted as an upward 
trend. The same effect might also create the illusion of a cyclical 
movement when observed at a later period. By that time, the series 
will have returned to the population average or even have dropped 
below it. We thus would observe as a cyclical peak, a movement in 
the data that was a combination of a random movement and serial 
correlation of the deviation of the series from its population average. 
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In analyzing a time series for its systematic and random compo¬ 
nents, the assumption is made that the four elements represent the 
result of different factors. The isolation of the three systematic ele¬ 
ments will, therefore, throw some light on the nature of the factors 
affecting them. This knowledge is then applied to project the effect 
of the systematic elements into the future. After allowances for ran¬ 
dom effects a forecast can be made on the basis of judgment. 

Thus, if Y t is used to symbolize the actual observation at time 
t, we shall express Y t in terms of the series’ trend at time t (T t ), the 
seasonal influence at time t (S t ), the cyclical influence (<?,), and 
random factors (R t ). One relationship often assumed is that 

Y t = TtXStXCtXRt. 

This equation states that the actual value of a variable at any time, 
Y t , is the product of the four elements considered. It implies that 
the magnitudes of T, S, C, and R are interrelated. This is not the 
only relationship we could assume, though it is the one most gener¬ 
ally used. For example, an alternative relationship is 

Y t = T t + S t + C t + R t . 

This implies that the magnitudes of the four elements are additive. 
In other words, the magnitude of one element would not be affected 
by the magnitude of any other. 

As noted, time series analysis deals with the isolation of the T, 8, 
and C factors from the original data. In subsequent sections, then, 
we shall discuss some of the techniques that may be used to isolate 
these elements. 

14.5 The Analysis of the Trend Element 

As we have stated, trend represents the resultant of the long-time, 
slowly moving forces that affect the data. In order to determine the 
nature of the trend, a first step is to plot the series on a chart. The 
chart, as we have stated previously, then serves a purpose similar to 
the scatter diagram in a regression problem. In fact, a trend relation¬ 
ship may be looked upon as a regression relationship with time as 
the independent variable. 

Before attempting any analysis of time series data, it is essential 
to see that the data are comparable for the entire period. Data 
gathered at different times are often based on units and terms dif¬ 
ferently defined, or are observations from different populations. In 
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such instances the data must be adjusted. Otherwise, differences in 
definition or coverage might be interpreted as evidence of systematic 
movements. 

Example 14.5 Consider the series on the number of clerical and 
kindred workers employed in the United States during the census 
years from 1900 to 1950. There have been changes in the definition 
of the term “employed” from census to census. There also have been 
changes in the coverage of the “clerical and kindred workers” clas¬ 
sification. Thus, for example, the census of 1950 excluded accountants 
and auditors from this classification, while prior censuses included 
them. It is necessary, therefore, to adjust the data before subjecting 
it to time series analysis. 

It is not always possible to make such adjustments. However, 
when the changes in definition or coverage are not large enough to 
affect the analysis of the series, we may use the unadjusted data. 
When the changes are more significant, it is necessary to break the 
data down into time periods of uniform coverage and definitions 
and then analyze each grouping independently. 

A Statistical Test for Trend . Even though it seems that there is an 
upward or downward drift in the data, we must remember that such 
movements could represent the fluctuations of a random process. 
An upward or downward movement could occur even though the 
process had no systematic element and actually was behaving in a 
completely random manner. Therefore, before a trend is fitted, we 
should decide whether, on the basis of the evidence at hand, the 
series represents a sample from a completely random process or re¬ 
flects a systematic movement. 

You should recognize that we are faced with a statistical decision 
problem where two alternate courses of action are possible: 

1. No trend should be fitted since the series represents a random proc¬ 
ess. 

2. A trend should be fitted since the series indicates the presence of 
systematic forces. 

Consider the differences between successive observations in a time 
series. The number of differences always will be one less than the 
number of observations. If many samples were drawn from a ran¬ 
dom process, then the expected number of positive differences be¬ 
tween successive observations would be equal to the expected num¬ 
ber of negative differences. If we call the expected number of dif¬ 
ferences having a positive sign, E(s), we have 

ft — 1 
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where n is the number of observations. Statisticians have developed 
a formula for the standard error of s under the assumption of a com¬ 
pletely random process. The formula is 

/n+7 

\ 12 

For large samples the sampling distribution of the number of posi¬ 
tive differences in a sample is approximated by the normal curve. 
The normal deviate is 


, ^ - m 

<T S ’ 

where 8 is the observed number of positive differences in a sample 
of n observations. 

Since n usually is given in a time series problem, in order to de¬ 
termine a decision rule we need set only the «-risk (i.e., the prob¬ 
ability of fitting a trend to the data when actually no trend exists). 
Let's look at an application. 


Example 14.6 Data for the Federal Reserve Board Index of In¬ 
dustrial Production from 1919 to 1957 are given in Table 14.1. These 
data already have been presented in Figure 14.3 (c). The chart in 
this figure seems to indicate a systematic upward movement. We now 
would like to apply the test discussed above to determine whether or 
not the appearance of an upward movement could have resulted from 
a completely random process. 


Table 74.7 

INDEX OF INDUSTRIAL PRODUCTION FOR THE UNITED STATES, 1919-57 


Year 


1919 

1920 

1921 

1922 

1923 

1924 

1925 

1926 

1927 

1928 

1929 

1930 

1931 


(1947-49 Average = 100) 


Index 

Sign of 
First 

Year 

Index 

Sign of 


Index 

Num¬ 

ber 

Differ¬ 

ence 

Num¬ 

ber 

Differ¬ 

ence 

Year 

Num¬ 

ber 

39 


1932.... 

31 

_ 

1945.. . . 

107 

41 

+ 

1933.... 

37 

+ 

1946.. .. 

90 

31 

— 

1934.. . . 

40 

+ 

1947. . .. 

100 

39 

+ 

1935. . .. 

47 

+ 

1948. . .. 

104 

47 

+ 

1936.. . . 

56 

+ 

1949.... 

97 

44 

— 

1937. . . . 

61 

+ 

1950.. . . 

112 

49 

+ 

1938.. . . 

48 

— 

1951. . . . 

120 

5L 

+ 

1939. . . . 

58 

+ 

1952. . . . 

124 

51 

* 

1940.... 

67 

+ 

1953.... 

134 

53 

+ 

1941.... 

87 

+ 

1954.... 

125 

59 

+ 

1942. . . . 

106 

+ 

1955. . . . 

139 

49 

— 

1943.... 

127 

+ 

1956. . . . 

143 

40 

— 

1944. . . . 

125 

— 

1957.... 

143 


Sign of 
First 
Differ¬ 
ence 


+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 


* There was no change between 1926 and 1927 nor between 1956 and 1957. These are counted as one half 
of a positive difference. 


























E(s) = 19 


The appropriate decision rule is thus: If the sample of 39 observa¬ 
tions results in a normal deviate of less than 1.96, no trend need be 
fitted. If the normal deviate is 1.96 or greater, a trend should be 
fitted. 

If we count the number of positive differences shown in Table 
14.1, we find $ = 25. Hence, 

s - E(s) 
z =- 

CT S 

25 - 19 


= 3.3. 

Since z is greater than 1.96, we conclude that a trend should be fitted. 

A few words of caution concerning this test are in order. The test 
considers only the direction of movement from time period to time 
period and not the magnitude of such movements. Thus, the magni¬ 
tude of the upward movements might be large, while the downward, 
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although equal in number, might be very small. The presence of 
trend in such instances would not be revealed by this test. Similarly, 
this test will indicate the presence of a trend where no trend exists if 
many small upward movements are canceled out by a few sharp 
downward movements. However, in such instances the chart of the 
data should reveal whether situations like those just described exist. 
We then would know that the use of this test for the presence of 
trend would be misleading. Figure 14.5 illustrates time series for 


Figure 14.5 

ILLUSTRATIONS OF HYPOTHETICAL TIME SERIES WHERE FIRST 
DIFFERENCE TEST FOR TREND DOES NOT APPLY 



which: (a) the above test would indicate a random process even 
though a trend were present, and ( b ) the test would indicate the 
existence of trend even though the process were completely random. 

Fitting the Trend. Once the presence of a systematic trend is 
indicated by a statistical test or on the basis of judgment, we must 
examine the data to develop a theory concerning the trend. Is the 
trend represented by a straight line or a curve? If a curve, what is 
the exact nature of the curve? To get a better idea of the consider¬ 
ations involved in deciding upon the nature of the trend, the follow¬ 
ing problem is instructive. 
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Example 14,7 Figure 14.6 illustrates the growth of the number of 
clerical and kindred workers employed in the United States from 
1900 to 1950, by 10-year periods. The data are adjusted for differ¬ 
ences in definition and coverage. 

Figure 14.6 

CLERICAL AND KINDRED WORKERS EMPLOYED IN THE UNITED STATES 
FOR CENSUS YEARS 1900-1950 



Observe that two possible trend lines are drawn to illustrate the 
long-term growth of the data. The curve weaves in and out of the ac¬ 
tual observations. These deviations could be looked upon as random 
effects. Hence, if we decide upon the curve rather than the straight 
line, our rationale is that the data represent trend and random ele¬ 
ments. 

However, if we decide upon the straight line trend, we imply that 
forces other than those giving rise to trend and random elements are 
operative. The fact that the point for 1940 is far below the straight 
line trend would be attributed to the presence of cyclical forces af¬ 
fecting the data. The census results for 1940 would be taken to reflect 
the impact of the decline in business activity during the 1930’s. Al¬ 
though the number of clerical workers increased during this period, 
our choice would imply that the increase was retarded because of the 
depressed conditions of that decade. 

The brief discussion in the above example should enable you to 
see the subjective nature of methods for determining the trend. The 
choice of an appropriate trend line must be made on the basis of 
knowledge of the data involved and on the basis of economic theory. 
It is not always a simple matter to determine the nature of a trend. 
In fact, two experts might devise different, though logical, theories 
as to the nature of the trend of a given time series. 

Once the form of a trend line is decided upon, it can be fitted to 
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the data by either freehand methods or by mathematically derived 
formulas. The advantage in mathematically fitted trends is that such 
trends can be extrapolated more easily. A freehand curve can be 
projected in many ways. It often is claimed that mathematical meth¬ 
ods have the advantage that all persons fitting the trend by this 
method will get the same results. This claim is of doubtful validity. 
In the final analysis a subjective decision on the adequacy of a trend 
must be made on the degree to which the fitted curve satisfies the 
concept of trend. It, therefore, would appear to be much simpler to 
draw freehand the curve that seems best to meet our theory of trend. 
There is no advantage in the fact that all who fit a mathematical 
curve will get the same results if they all get the same incorrect re¬ 
sults. Furthermore, in fitting a mathematical curve there always is 
the danger of ascribing to the curve a rigor and precision far beyond 
the subjective analysis upon which its choice is based. This would 
be similar to measuring the dimensions of a dining-room table to the 
nearest thousandth of an inch in order to purchase a table cloth. 

However, if a mathematical curve is to be used, we can fit a least- 
squares straight-line trend with the same formulas as those applied 
in fitting a regression line. In time series analysis the X-variable 
represents time. Hence, our straight-line equation is 

T t - a + bX , 

where T t is the trend of the time series. Let us see how we may esti¬ 
mate the trend line for a given series. 

Example 14.8 . The straight-line trend in Figure 14.6 is drawn in 
freehand. The fitting of a least-squares straight line to the same data 
is illustrated below. Note that time is the X-variable. X-values have 
been assigned to each year as 0, 1, . . . , for convenience. Hence, we 
must remember that X = 0 in 1900 and that for each 10 years, X in¬ 
creases one unit. 


COMPUTATION OF LEAST-SQUARES TREND LINE FOR EMPLOYED 
CLERICAL AND KINDRED WORKERS, 1900-1950 




Workers 

— 

-— k : ■, 

Year 


(Millions) 




X 

Y 

XY 

X 2 

1900. 

0 

0.8 

0 

0 

1910. 

1 

1.9 

1.9 

1 

1920. 

2 

2.8 

5.6 

4 

1930. 

3 

4.1 

12.3 

9 

1940. 

4 

4.4 

17.6 

16 

1950. 

5 

6.9 

34.5 

25 

Total. 

15 

20.9 

71.9 

55 
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The values of b and a are calculated by the least-squares formulas 
which were presented in the preceding chapter. 

nXXY - SXSF 
b ~ nSX 2 - (2X) 2 
6(71.9) - 15(20.9) 

6(55) - (15) 2 
431.4 - 313.5 
330 - 225 
= 1.12 

a = Y -bX 


Hence, 


209 

6 

= 0.68 



T t = 0.68 + 1.12X. 
(Origin: 1900, X = 10 years.) 


Whenever the equation of a trend line is given, the origin —the year 
at which X is zero—and the unit of time in which X is expressed must 
be indicated. 


In fitting any type of trend line to data marked by cyclical influ¬ 
ences, care must be taken if the time series starts at one phase of 
the cycle and ends at another phase. Thus, a trend fitted to a series 
that starts at a low point in the cycle and ends at a high point will 
have an exaggerated upward slope. Similarly, a downward slope 
will be exaggerated, or an upward slope understated, if we start at a 
peak period and end at a low period. If we were to fit a trend line 
for the series on clerical and kindred workers from 1940 to 1950, we 
would have too steep a trend, unless we took into consideration the 
fact that the year 1940 represented a cyclical low point. Similarly, 
the slope of a line would not be steep enough if we were to fit the 
trend to the period 1910-40 and again neglected the fact that 1940 
represents a low point in the cycle. This again emphasizes the im¬ 
portance of a thorough knowledge of the subject matter in fitting 
trend lines. 

Similar methods are available for fitting curvilinear trends. How¬ 
ever, we shall not deal with those methods in this text. 

Projection of Trends . Trend lines may be fitted for historical 
reasons, i.e., to indicate the growth of a series in the past. However, 
their main purpose in business problems is projection. Thus, an 
electric power company that provides services to residents of a cer¬ 
tain area must project the demand for power at least 5 years into 
the future. It takes that long to expand the capacity of such a utility. 
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Similarly, an oil company might project the demand for oil 25 years 
into the future to determine the sufficiency of available resources 
and to plan for future resource development. In addition, a study is 
now under way to determine the characteristics of the New York 
Metropolitan Region in 1975. Such projections are needed as a basis 
tor intelligent regional planning and development. 

It takes only a second to project a trend with ruler and pencil 
even for 50 years into the future. Such projections, however, rest on 
assumptions about the future that must be recognized. 

Example 14.9 It would be a simple matter to project a freehand 
trend Ime, or a least-squares line, to predict the number of employed 
clencal and kindred workers for any future year. In so doing how¬ 
ever, we would be assuming that the impact of the trend factors in- 
uencmg the growth in the past will have the same combined effect in 
the future. Among the more important of these forces are: population 
growth growth m the labor force, average hours worked, Jrodtmtiv- 
lty (automation, etc.), changes m relative importance of service in¬ 
dustries, changes m size of businesses, changes in business loca- 

Let us discuss briefly some of the matters that would have to be 
considered m order to determine whether or not the trend forces 
would act m the future as in the past. 

The rate of population growth which had been decreasing from 
ecade to decade since 1870, had increased between 1940 and 1950. 
This, by itself, would tend to make for an increased rate of growth in 

ofVooah^n 6 ' Th ° trend ’i ne> . however > was determined on the basis 
f 900 1950 experience. Therefore, a simple extension of a trend line 
might underestimate the impact of the accelerated growth in popula¬ 
tion and labor force even though the proportion of the labor force go- 
mg mto clerical operations remained constant. 

If productivity should increase in clerical professions due to the in¬ 
creased use of electronic equipment and other business machines 
fewer people might be employed in clerical occupations. But if the 
aveiage hours of work per week should decrease in the future this 
would m part counteract the results of increased productivity 'Fur¬ 
thermore, electronic equipment is still relatively expensive and 
hence its installation is economically feasible for only large firms 
There has been a tendency, however, for industry to concentrate in 
industrial parks. This has given rise to firms whose function is to 
provide clerical services for these industries with the aid of elec¬ 
tronic equipment. Should this procedure become more and more ac 
cepted it would allow small firms to avail themselves of increased 
productivity due to electronic equipment and make for a decrease in 
the number of clerical workers needed. 

Service industries are becoming relatively more important in our 
economy. Generally, clerical workers comprise a large proportion of 
total employment in these industries. Therefore, if this tendency con- 
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tinues, it would tend to make for relatively greater employment of 

clerical workers. . M , ,,, i ■, 

We could continue in this vein, discussing other factors that would 
make for more rapid growth, slower growth, constant growth, or a 
decline in the number of employed clerical workers m the future. Ex¬ 
pert i udgment must be used to evaluate the net effect of such develop- 
ments in determining the necessity for adjustment m any trend pro- 
jection. 

By carefully spelling out the assumptions upon which a forecast 
is made, a method of analysis for future events is provided. As t e 
strength or weakness of the initial assumptions becomes apparent, . 
adjustments can be made in the prediction. Good forecasting does 
not imply merely a precise estimate. Such precision often may be 
due to fortuitous circumstances. Good forecasting implies a system 
of continuous analysis-a system that will disclose why initial fore¬ 
casts were in error or why they were correct. Only m this way can 
we gain confidence in a forecasting procedure. 

It is unlikely that one can forsee the future with a high degree of 
accuracy. A general rule for sound forecasting is, therefore: revise 
any forecast continuously. Forecasting is not a one-shot operation. 
To be effective, it requires continuous attention. Thus, if the de¬ 
mand for a company’s product for 1965 is estimated m 1958, a good 
rule would be to revise the estimate in 1959, 1960, and so on. Unan¬ 
ticipated developments will often change our picture of the future, 
or at least clarify it. In terms of any original decisions and actions 
that have been taken, this rule implies continuous modification, 
where possible. Decisions about the future cannot be made and then 
not reviewed. Such decisions must be made sequentially and con¬ 
tinuously. 

In conclusion, remember that in projecting trends we are con¬ 
cerned with only one of the systematic elements that economists 
attribute to time series. Therefore, at times, the possible impact of 
the other elements—seasonal, cyclical, and random—must be con- 
sidered. 

The Moving Average. At times it is difficult to determine the 
nature of the trend because the trend movements are obscured by 
the effect of random and cyclical movements. A useful device for 
smoothing out the random fluctuations and the cyclical movements 
is a moving average. The method of computing a moving average is 
best explained with an illustration. 

Example 1410 Below a 3-item moving average is determined for 
the number of ordinary life insurance policies in force in the United 
States from 1950-57. 
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T° take a 3-item moving average, we total the first 3 items (64 + 
. + 70 - 201). The total is placed opposite the middle item—-201 

T e then dro 5 the first item (64) and add the 

2im d* I? 7 ? to ,, get our second movm g total (67 + 70 + 73 = 
tn°J VT)! *® n drop the seoond item (67) and add the fifth item (76) 
t0 .f, t thlrd moving total (70 + 73 + 76 = 219), and so on To 
get the 3-fiem moving average we divide each moving total by 3 


Year 

Number of 
Policies 
(in Millions) 

3-1 tem 
Moving 
Total 

3-Item 

Moving 

Average 

1950. 

p.a 

_— 


1951. 

f\ 7 

201 

210 

219 

229 

239 

250 


1952. 

70 

67.0 

1953. 

4 U 

70 

70.0 

1954. 

i o 

7fi 

73.0 

1955. 

i D 
on 

76.3 

1956. 

oU 

CO 

79.7 

1957. 

oo 

557 

83.3 

—---- 

o i 




—vc wne i we raKe a moving average, we lose observations 

items 6 the'^Hn^ ld 60,3 ° f & ThUS ’ th ° 0riginal series had 8 

items, the moving average series has only 6 items. This fact is » 

must not mS ] ° f + Se ° f . the movin g average for forecasting. We 
must not only project a series of moving averages into the future but 
we must first project the series into the present. 

Now that we know how to determine a moving average, let us 

ofXta 6 ° W a m ° V nS aV6rage ° an Sm °° th fluctuations in a series 

^Example 14.11 A hypothetical time series is plotted in Figure 


Figure 14.7 

HYPOTHETICAL TIME SERIES AND FOUR-YEAR MOVING AVERAGE 
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You will observe that the given data are periodic m nature. The 
number of years from peak to peak (year 3 to year 7 and year 7 to 
year 11) and from trough to trough (year 1 to year 5, year 5 to year 
9 and year 9 to year 13) always is 4 years. Let us see what happens 
to the data if we take a moving average equal in length to the period 
of fluctuation—4 periods. The 4-item moving average for the data is 
computed below: 


WORK SHEET FOR COMPUTING FOUR-ITEM MOVING AVERAGE 


(1) 

Year 

(2; 

Value 

(3) 

4-Item 

Moving 

Total 

(4) 

4-Item 

Moving 

Average 

(5) 

2-Item 

Moving 

Total 

(6) 

2-Item 

Moving 

Average 


30 





2 . 

40 


40 



3 . 

50 

160 

80 

40 

4 . 

40 

160 

40 

80 

40 

5 . 

30 

160 

40 

80 

40 

6 . 

40 

160 

40 

80 

40 

7 . 

50 

160 

40 

80 

40 

8 . 

40 

160 

40 

80 

40 

9 . 

30 

160 

40 

80 

40 

10 . 

40 

160 

40 

80 

40 

11 . 

50 

160 

40 

80 

40 

12 . 

40 

160 

40 



13. 

30 






Note that for a 4-item moving average the middle of each group is 
between the second and third items of that group. Whenever an 
even-item moving average is taken, each moving average value tails 
between 2 items. It is desirable at times, therefore, to center the mov¬ 
ing average. This can be done by taking a 2-item moving average of 
the moving average series already computed. The procedure is dem¬ 
onstrated in columns 5 and 6 above. The first value of the final mov¬ 
ing average series is centered at period 3 and the last value is at pe- 

rinrl ll- 


Ill the previous illustration, the 4-item moving average com- 
pletely smooths out the ups and downs in the series which has a re¬ 
occurring cycle of 4 years. However, random movements and cycli- 
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cal movements are, as a rule, not completely periodic in nature 
Therefore, most time series will not be smoothed perfectly by a mov¬ 
ing average—some irregularities will remain. In general, an n-item 
moving average will have a smoothing effect on fluctuations with 
average periodicities of n time periods and less than n time periods 
Thus, a 12-item moving average will smooth out fluctuations of 12 
or fewer time periods in duration. 

The moving average device is helpful in revealing the trend by 
smoothing out cyclical and random fluctuations. However, it is also 
possible for a moving average to introduce cyclical fluctuations into 
a series of data that are random in nature. This tendency is known 
as the Slutzky-Yule effect, named after the two men who studied 
this characteristic of moving averages. The possibility of this effect 
calls for great care on the part of the analyst who uses moving aver- 
ages to smooth data. 

Example 14.12 This illustration has been designed to show how 
a moving average can introduce the appearance of a cyclical fluctua- 
tion m time senes. Let us assume that the following hypothetical 
data are given and that a 5-year moving average is computed. 


Year 

Value 

5-Year 
Moving 
Total 

5-Year 
Moving 
Average 

1. 

100 



2. 

100 



3. 

100 

500 

100 

4. 

100 

420 

84 

5. 

100 

420 

84 

6. 

20 

420 

84 

7. 

100 

420 

84 

8. . . .. 

100 

420 

84 

9. 

100 

500 

100 

10. 

100 

500 

100 

11. 

100 



12. 

100 




Figure 14.8 portrays the original time series and the moving aver- 

age. 

We thus observe a series which is constant except for the occur¬ 
rence of a random event. However, the 5-year moving average has 
converted this into a series which appears to have a cycle with a flat¬ 
tened trough between years 4 and 8. 

It is not unusual to find such a situation in a time series Thus the 
series might represent the volume of production of a given industry. 
Production is fairly constant for 5 years. During the sixth year a 
prolonged strike (random event) occurs and production drops off 
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sharply. The strike is settled, and then production reverts to its ear¬ 
lier level. A moving average, as we have seen, would make this ran¬ 
dom event appear as a cyclical fluctuation. 

Figure 7 4.8 

HYPOTHETICAL TIME SERIES AND A FIVE-YEAR MOVING AVERAGE 



14.6 The Analysis of Seasonal Movements 

Time series data fluctuate from month to month during any 
given year. For example, the demand for automobiles varies from 
month to month, as does the level of automobile production. When 
the pattern of variability remains fairly constant from year to year, 
we say that there is a systematic seasonal factor influencing the data. 
Thus, department store sales are high before Christmas and Easter 
and low during the summer months. The production of ladies’ coats 
and suits is at a high level during the summer months and during 
February and March but is at a lower level during the rest of the 

year. . . 

A knowledge of the seasonal pattern is an important factor m busi¬ 
ness planning. The knowledge that flower sales will be high in May 
because of Mother’s Day tells the florist that he must stock flowers 
to meet the demand. The knowledge that production will be high 
during the summer months helps the garment manufacturer to plan 
his working capital requirements, recruitment of labor, machine 
maintenance needs, and so on. 

Knowledge of seasonal movements also helps management to 
evaluate past and current performance. Thus, in a given year, sales 
for the month of March might be 10 per cent above February sales 
but less than the 20 per cent increase that should be expected on a 
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seasonal basis. An alert firm should investigate the reasons for such 
occurrences and, if they find it necessary, should make the appropri- 
ate policy changes. 

A business, knowing the character of a seasonal pattern, has a basis 
for deciding, if possible, whether or not to change it. Thus, one soap 
manufacturing concern was able to smooth out the ups and downs 
m its pattern of production by changing its method of distribution. 
A cold cereal manufacturer, observing that cereal sales fall off dur¬ 
ing winter months, might attempt to raise the level of sales during 

that period by offering special prizes to purchasers during the winter 
months. 

At other times, it is desirable to measure seasonal fluctuations 
simply to eliminate them from the original data. This often enables 

us to study the other systematic movements (trend and cycle) more 
easily. 

. f Van r T th0 , dS haVe been develo P ed to measure seasonal var¬ 
iation. The logic behind all of these methods is to eliminate the in¬ 
fluence of trend, cyclical, and random elements from the original 
time series data so that only the influence of the seasonal element 
remains. We shall describe only the mtio-to-moving-average method 
lor isolating seasonality, since it is most widely applicable. 

The basic steps m the “ratio-to-moving-average” method to de¬ 
rive a monthly seasonal are as follows: 

i. F ir st, take a 12-month moving average of the data. As you will 
recall a 12-month moving average smooths out fluctuations of 12 
months or less duration-these will include seasonal fluctuations, 
hus, our first step in isolating seasonal movements is to eliminate 

em from the data. Again assume that an observation at time t is 
the product of four elements 


Yt = T, X C t X S, X R t . 


The purpose of the moving average is 
movements from the data. The moving 
proximately equal to 


to eliminate the seasonal 
average, therefore, is ap- 


M A A . 

2. The second step is to take the ratio of the value of each month 
to the moving average for that month. The approximate effect of 
this step can be stated symbolically as follows: 


__ ■* t _ 

Moving Avebage 


TiXC t XS t XR t c 
T, X C, xli't ~ ■ 
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Thus, this step gives an approximation' of the seasonal movement 

U1 a^The ratio-to-moving-average terms for all Januaries, Febru¬ 
aries Decembers are then averaged (and adjusted) to de¬ 

termine aset of 12 seasonal indices. The effect of the averaging step 
is to remove random, cyclical, or trend elements that are not re¬ 
moved in the earlier steps. _ 

In the example that follows we apply this method to estimate a 
monthly seasonal index for the number of initial claims for unem¬ 
ployment insurance made on state programs. Unemployment data 
are one of the key indicators of shifts in general business activity. 
However, to reach decisions on the basis of unemployment data, it 
is necessary to adjust the data for seasonal fluctuations. Before fur¬ 
ther discussion, let us compute seasonal indices. 

Example 14.13 Table 14.2 presents a worksheet for determining 
the seasonal movement in initial unemployment insurance claims 
made on state programs by the ratio-to-moving-average method. In 
Sis illustration! data for only 5 years, 1953 to 1957, are used to com¬ 
pute seasonal indices. This is, as a rule, too short a penod , the 
ratio-to-moving-average method relies on averaging to remove non- 
seasonal effects. The longer the series, the greater the likelihood of 
averaging out nonseasonal effects. Generally, 8 to 15 years has been 

found to be a satisfactory period. . 

The first step in the ratio-to-moving-average method, the comp 
tation of the 12-month moving average, is performed m columns 
(b) and (c) of Table 14.2. You will notice that we have centered the 
moving average opposite the seventh month (July) instead of be¬ 
tween the sixth and seventh months. In this way we avoid the neces¬ 
sity of taking a 2-item moving average to center our onginal mov¬ 
ing averages on a month instead of between months. In principle, it 
may be preferable to follow the more detailed procedure. However, 
such refinements result in negligible improvements and 
not worth the added cost. The moving average is presented with the 
original data in Figure 14.9 on page 476. ( . , 

The second step, the computation of the ratios of ^e ac tua^data 
to the moving average, is performed in column (d). This yields a 
approximation of the seasonal fluctuation for each month. 

P The third step, the averaging of the ratios for each month is 
performed in Table 14.3. In averaging the ratios we shoidd try to 
avoid the distortion of the average by extreme values. This may 
be done by computing the median, which is unaffected by extreme 

"""An alternate method is to take a medial average, generally the 
mean of the middle three or five ratios. A medml average, the mean 
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Table 14.2 


w ?» R ,«^r ET r .f OR DETERMINING MONTHLY RATIO TO MOVING 
AVERAGE FOR INITIAL UNEMPLOYMENT INSURANCE CLAIMS 
_° N st ATE PROGRAMS FOR THE YEARS 1953-57 


Year and 
Month 


1953: 

January. . . . 
February. . . 

March. 

April. 

May. 

June. 

July. 

August. 

September. . , 

October. 

November. . . 
December.... 

1954: 

January. 

February. . . . 

March. 

April. 

May. 

June. 

July. 

August. 

September. . . 

October. 

November. . . 
December.... 

1955: 

January. 

February. . . . 

March. 

April. 

May. 

June. 

July. 

August. 

September. . . . 

October. 

November. . . . 
December. 

1956: 

January. 

February. 


(a) 

(b) 

(c) 

-. -‘t . . 

id) 

Weekly 


Average of 




Number of 

12-Month 

12-Month 

Ratio to 

Initial 

Moving 

Moving 

Moving 

Claims 

Total 

Average 

Average 

(in Thou- 


(a 4- c ) X 100 

sands) 




236 




184 




179 




190 




186 




182 




213 

189 

186 

209 

296 

2,601 

2,781 

2,932 

3,056 

3,194 

217 

232 

244 

251 

266 

98.2 

81.5 

76.2 

83.2 

111.3 

351 

3,300 

275 

127.6 

416 

3,407 

284 

146.7 

335 

3,497 

291 

115.1 

303 

3,571 

298 

111.7 

328 

3,640 

! 303 

108,3 

292 

3,693 

313 

93.3 

289 

303 

3,668 

3,632 

306 

303 

94.9 

100.0 

263 

3,572 

298 

88.3 

255 

3,494 

291 

87.6 

262 

3,407 

284 

92.3 

271 

3,317 

276 

98.2 

315 

3,230 

269 

113.4 

356 

3,143 

262 

135.9 

257 

3,068 

256 

100.0 

216 

238 

205 

202 

228 

2,994 

2,902 

2,827 

2,767 

2,722 

250 

242 

236 

231 

227 

86.4 

98.3 

87.7 

87.4 

100.0 

189 

2,673 

223 

84.8 

163 

187 

211 

270 

2,666 

2,663 

2,659 

2,670 

222 

222 

222 

223 

73.4 

84.2 

95.0 

121.1 

307 

2,673 

223 

137,7 

250 

2,699 

225 

111.1 
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Year and 

Month 

(a) 

Weekly 
Average of 
Number of 
Initial 
Claims 
(in Thou¬ 
sands) 

(b) 

12-Month 

Moving 

Total 

(c) 

12-Month 

Moving 

Average 

(d) 

Ratio to 
Moving 
Average 
(arc)X 100 


213 

2,692 

2,719 

224 

95.1 


234 

227 

226 

227 

luo.l 


216 

2,713 

yo.o 


205 

2,723 

yu.o 

T n l v . 

254 

2,740 

229 

liu.y 


182 

2.779 

2.780 

231 

78.8 

81.9 

78.0 


190 

232 

C'1/vfrihp’P . 

181 

2,781 

232 

NnvAtrihpr .. 

221 

2,797 

233 

94.8 

125.7 

. * 

293 

2,799 

233 

1957: 

340 

2,814 

235 

144.7 

"PWvriiRrv . 

251 

2,836 

236 

106.4 

90.3 

1 AO O 

IVTarpVi . 

214 

2,845 

237 


250 

2,901 

242 

1 KJOeO 
q * 7 A 


218 

2,979 

248 

o/.y 

IVJL&J .... * . 

220 

3,078 

257 

85.6 

1 AO i? 

Tnlv . 

276 

3,230 

269 

lUAb 

A ii oust, . 

191 





246 




OwyCUiuci ..* • 

259 




NmrAiYihpr .. 

320 




December. 

445 





Source: Economic Report of the Present, January, 1956, p. 187, and January, 1958, p. 139. 

of the items left after discarding the highest and the lowest ratios, is 

taken of the ratios in Table 14.3. ,• 

A last step requires the adjustment of the averages of the ratios 
(crude seasonal) so that they average 100 per cent each—that is,, so 
+W thpv total 1 200. The crude seasonal indices total only 

X i. .Uttipw by 

resulting values are the seasonal indices which sum to 1,200.0. iable 
14.3 also indicates this step. 

The seasonal index for January, as computed in the last example, 
is 137.6. What does this mean? The index tells us that on the basis 
of the 1953-57 data, the weekly number of unemployment insurance 
claims for January averaged 37.6 per cent above the average for the 
year The index of 84.6 for September indicates that claims tor that 
month averaged 15.4 per cent below the average for the year. 

























co 
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Figure 7 4.9 

INITIAL UNEMPLOYMENT INSURANCE CLAIMS ON STATE PROGRAMS, 

1953-57 

(Weekly Average in Thousands) 



To adjust data for seasonal influence, actual data observations are 
divided by the seasonal index. Thus, symbolically 


Seasonally Adjusted Data = 


S t 

TtXStXCtX Rt 
St 


= TtXCtXRt' 


Data which have been adjusted for seasonal variation do not show 
what the behavior of the series would have been in the absence of 
seasonal forces. The seasonal forces make up part of a complex of 
factors influencing the series. How the series would have behaved if 
all other forces but the seasonal were present is a question no one 
can answer. The deseasonalized data represent an average value. 
For example, if the remainder of the year would have operated on 
the same level as the month of January, then the average number 
of initial claims for the year 1957 would have been 

January, 1957, Average Weekly Claims 340 

Seasonal Index 137.6 

= 247 Thousand Claims . 
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. Example 14.14 During the recession of 1957-58 the President and 
his economic advisors studied unemployment figures carefully as an 
indicator of the state of the economy in order to reach decisions on 
anti-recession measures. In addition to the unemployment figures re¬ 
lating to workers covered by state unemployment insurance pro¬ 
grams, there is the monthly estimate of unemployment based on a 
sample survey, conducted by the Department of Commerce. For Feb¬ 
ruary, 1958, this survey reported 5,173,000 unemployed. The March 
figure was 5,198,000 unemployed—indicating an actual increase of 
only 25,000. However, to properly assess the change, the two figures 
must be adjusted for seasonal variation. When this is done by divid¬ 
ing each figure by the appropriate seasonal index, the increase in un¬ 
employment on a seasonally adjusted basis is more than 200,000. The 
calculations, based on Department of Commerce seasonal indices for 
February and March, 1958, are as follows: 


Month 

Actual Number 
of Unemployed 
(in Thousands) 

Seasonal 

Index 

Seasonally Ad¬ 
justed Number 
of Unemployed 
(in Thousands) 

February. 

5,173 

115.9 

4,463 

March. 

5,198 

111.3 

4,670 

Increase. 

25 


207 


The seasonal index considered in Example 14.13 is called a con¬ 
stant seasonal. That is, the seasonal index for each month is assumed 
to be the same from year to year. This very often is not a valid as¬ 
sumption since seasonal indices sometimes exhibit trends and cycli¬ 
cal movements. Such indices are called moving seasonals . The 
method we have described, with certain additional steps, can be 
applied to determine a moving seasonal, although we shall not cover 
the technique in this text. 

The index we have computed also can be described as a relative 
seasonal since it is expressed as a percentage. A relative seasonal 
assumes that the seasonal effect is proportional to the level of the 
series. Thus, at higher levels the amplitude of seasonal fluctuations 
will be greater. The seasonal index for February, in Example 14.13, 
was 109.6. This implies that the actual values are, on the average, 
9.6 per cent above C t X T t X Rt . At times, the seasonal fluctuations 
are not proportional to the level of the series but remain approxi¬ 
mately the same in absolute amount no matter what the level. A 
seasonal index which describes such movements is called an abso¬ 
lute seasonal. The method for deriving this type of seasonal index 
differs only slightly from the method described above. 
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Seasonal indices, therefore, can be either constant or moving. In 
either event they may be stated in relative or absolute terms. As you 
can well imagine, the computation of a seasonal index is a laborious 
and costly process. However, during the past few years, electronic 
computers have been used to compute seasonal indices. The pro¬ 
cedure the machine simulates is the ratio-to-moving-average 
method. It can be used to compute relative or absolute seasonals, 
and either constant or moving seasonals. The machine not only com¬ 
putes the seasonal index but also adjusts the original series. 

14.7 Cyclical Fluctuations 

The classical method for isolating cyclical fluctuations is called 
the residual method. The trend and seasonal indices are determined 
and then eliminated. This leaves the cyclical and random effects. A 
moving average, usually of from 3 to 5 items, then is used to smooth 
out the random influences. The resulting values are considered meas¬ 
ures of cyclical fluctuations. 

There are serious objections to this method. These were stated 
most effectively by Arthur F. Burns and Wesley C. Mitchell, two 
authorities in the empirical study of business cycles: 1 

We do not follow that plan. In the first place, the isolation of cyclical 
fluctuations is a highly uncertain operation. Edwin Frickey once dili¬ 
gently assembled 23 trend lines fitted by various investigators to pig 
iron production in the United States, and found that some of the tiend 
lines yield cycles averaging 3 or 4 years in duration while others yield 
cycles more than ten times as long. This range of results illustrates viv¬ 
idly the uncertainty that attaches to separations of trends and cycles, 
though it perhaps exaggerates the difficulties. If an investigator fits a 
trend line in a mechanical manner, without specifying in advance his 
conception of the secular trend or of cyclical fluctuations, he may get 
“cycles” of almost any duration. But an informed investigator who is 
seriously studying cycles of a given order of duration will use whatever 
guidance he can get from history and statistics; he will scrutinize the 
movements of the original data, seek to mark off in advance the cycles 
or traces of cycles that correspond to his basic conception, then choose a 
trend line that cuts through and exposes the cycles in which his interest 
centers. Yet this procedure also illustrates the difficulties of segregating 
trends and cycles. For it leaves room for choice of the trend line, the 
method of fit, and the method of trend elimination. Further, it makes 
the trend depend upon the cycles, and may not lead to the discovery of 
eycles that are obscured by the trend. To judge what features of the 

” i Arthur F. Bums and Wesley C. Mitchell, Measuring Business Cycles (New 
York: National Bureau of Economic Research, 1946), pp. 37-38. Quoted by permis¬ 
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final result are merely technical and what features are a significant 
characteristic of the series is likely to require considerable testing and 
experimenting even on the part of a skilled technician. 

It is fairly common for statisticians to assume that the elimination of 
the secular trend from a time series indicates what the course of the 
series would have been in the absence of secular movements, and that 
the graduation of a time series, whether in original or trend-adjusted 
form, indicates what the course of the series would have been in the ab¬ 
sence of random movements. There is no warrant for such simple inter¬ 
pretations. A “least squares” trend line fitted, for example, to grocery 
chain store sales in the United States may move majestically on a chart, 
but the analytic significance of the trend line is obscure. At least some of 
the “growth factors” impinging on this branch ofbusiness—the addition 
of meats and vegetables to the grocery line, the rise of supermarkets, 
special taxes on chain stores—have made their influence felt spasmodi¬ 
cally. When a continuous “trend factor” is eliminated from the data, it is 
therefore difficult to say what influences impinging on the activity have 
been removed and what influences have been left in the series. Cyclical 
graduations are no easier to interpret than trend adjustments. Systematic 
smoothing of a time series will, indeed, eliminate short-run oscillations 
produced by random factors; but can it eliminate the influence of power¬ 
ful random factors—such as a protracted strike, or a succession of bad 
harvests, or a great war? 

There is always danger that the statistical operations performed on 
the original data may lead an investigator to bury real problems and 
worry about false ones. ... 

Mitchell, during his lifetime, was associated with the National Bu¬ 
reau of Economic Research. Burns still is associated with the NBER, 
This organization, since its creation in the middle 1920’s, has been a 
guiding force in the conduct of empirical research relating to the 
business cycle. The Bureau has analyzed the cyclical behavior of 
over 1,000 economic time series, American and foreign. It prefers to 
use monthly or quarterly data to study cycles since annual data of¬ 
ten conceal the timing of cyclical peaks or troughs. Data are ad¬ 
justed for seasonal fluctuation before the study of cyclical move¬ 
ments begins. This facilitates the study of the cyclical movements 
which often are obscured by seasonal fluctuations. 

The Bureau relates its study of cycles in each series to a table of 
reference dates, showing the months and years of troughs and peaks 
in general business activity. For the United States, the tables are 
based upon business annals and then refined, tested, and amended 
on the basis of subsequent research. The peaks and troughs in the 
cycles for each series are examined for leads and lags relative to 
the reference cycles. The Bureau at present is experimenting 
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with the information it has gathered relative to leads and lags to 
predict fluctuations in general business activity. 

The National Bureau, in its empirical studies, has deviated from 
the traditional approach to the analysis of systematic movements in 
time series. Other investigators have set up methods of analysis that 
also differ from the one we have defined. A discussion of these al¬ 
ternative methods is beyond the scope of this book. We make refer¬ 
ence to them only to stress that the traditional residual method has 
several alternatives. 

While the various approaches to time series analysis have been 
useful in describing the historical behavior of time series, none of 
these has yielded reliable methods of predicting future cyclical be¬ 
havior. In most projections on the basis of time series it is necessary 
to use judgment reinforced with various types of economic data to 
predict the nature of cyclical fluctuations. Thus, the best (and per¬ 
haps the only) way to forecast cyclical shifts in business activity is 
through a thorough understanding of existing economic conditions 
and an economic analysis of what they might imply for the imme¬ 
diate future. 

14.8 Trend, Seasonal, and Cyclical Elements—a Synthesis 

We have reviewed the methods used to isolate the systematic ele¬ 
ments in a time series. How do we combine these elements for fore¬ 
casting purposes? 

In long-range forecasting we are concerned mainly with trend. 
If we need to predict demand for electric power 20 years hence, it 
is sufficient to predict the trend of annual demand. On the basis of 
this trend forecast a utility company, for example, could decide on 
the extent of an expansion program and make plans for its long- 
range financing. It would make little sense to attempt to predict 
the seasonal pattern of demand at that time since such data, even if 
they could be predicted, would be of little value in decision making 
today. 

At times, it might be desirable to predict the cyclical influence in 
a long-range forecast. However, such predictions still are beyond our 
fondest hope. Even short-run cyclical effects cannot be predicted 
consistently. Therefore, as a practical expedient one should regu¬ 
larly check a long-range forecast of trend. As the period for which 
the forecast was made approaches, the revised trend forecast can be 
modified on the basis of a qualitative analysis of the cyclical influ¬ 
ence. 


Ch. 14] 


FORECASTING WITH TIME SERIES 


481 


In conclusion, let us now look at the following illustration of a 
short-run forecast. 


. Exam P l e 14.15 A company desires to forecast sales for the com¬ 
ing year. It can use the forecast as a basis for decisions on the 
budget for the year, production schedules, sales quotas, advertising 
plans, procurement policy, and labor needs. In the short-run forecast 
however, it is necessary to know the seasonal pattern of sales during 
the year, m addition to forecasting annual sales. The Eastman Ko¬ 
dak Company prepares such short-run forecasts for its products. 2 Its 
procedure is as follows: 

1. As a first step, the company extrapolates the past trend of 
sales. This gives an estimate of trend. 


f At “ , next necessary to modify the trend forecast for cyclical 
and possible random effects, and for possible changes in past trend 
forces m the future. The modifying factors considered are as follows* 
a) The assumed pattern of general business conditions. 

Any changes in the competitive position of the company. 

The current dealer inventory situation. 

Sales plans and policies. 

Advertising and promotional plans. 

Product research and development. 

Price changes anticipated. 

Capacity. 

Availability of labor and materials. 

. Government controls. 

The analysis of the influence of these factors on past trends is the 
basis for the adjustment. 


b) 

c) 

d ) 

e) 
/) 
9) 

h) 

i) 


tt then is necessary to allocate the annual forecast among the 
months of the year on the basis of the seasonal pattern. Let us as¬ 
sume that the annual sales forecast for a product is $1,800 000 The 
average monthly sales will, therefore, be $160,000. If the seasonal in¬ 
dex for January is .80, then the forecast of January sales is: 


$150,000 X .80 = $120,000 . 

Other monthly sales estimates are derived by multiplying $150000 
by the appropriate seasonal indices. 


This illustration again should emphasize that forecasting on the 
basis of time series analysis leans heavily on expert judgment. The 
projection of the trend is a relatively minor step in the annual sales 
forecast. The subsequent adjustment of this projection is the key 
factor m determining the actual forecast. Unfortunately, predictions 
of this type are not wholly within the province of statistical theory. 
The risk of an error in the forecasting procedure cannot be meas- 


See Company Organization for Economic Forecasting, American Management 
Association, Research Report No. 28 (New York, 1957), p 51 . management 



482 STATISTICS FOR BUSINESS DECISIONS [Ch. 14 

ured by probability theory, and, hence, such risks cannot be con- 
trolled. 

14.9 You Should Now Know That 

Forecasting involves the methods and theory of making decisions 
for the future on the basis of a study of present populations. 

A time series consists of a set of observations made at different 
periods of time. 

At the present state of our knowledge, any forecast of the future 
behavior of a time series must rest more heavily on judgment than 
on statistical theory. 

A time series may be graphically presented on arithmetic scale 
charts or on semilogarithmic charts. 

The trend of a time series is the resultant of the long-term eco¬ 
nomic forces. 

Cycles are oscillatory movements, of more than one year in dura¬ 
tion, which result from ups and downs in general business activity. 

Seasonal fluctuations are variations which occur regularly during 
the period of a year and are the result of climatic conditions and cus¬ 
toms. 

Time series are also subject to random fluctuations. 

Time series analysis deals with the isolation of systematic ele¬ 
ments—i.e., trend, cyclical, and seasonal movements. 

On the basis of the factors identified, an attempt is made to fore¬ 
cast the future course of events. 

A time series may be tested for trend on the basis of observed 
signs of differences between successive terms. 

The trend of a time series may be indicated by a moving average. 

A trend line may be fitted to a time series as a freehand line or as 
a mathematical equation. 

Seasonal fluctuations may be estimated by the ratio-to-moving- 
average method. 

Seasonal indices may be constant or moving and also may be rela¬ 
tive or absolute. 

Cyclical fluctuations are best forecasted on the basis of sound eco¬ 
nomic knowledge of the current state of the economy. 

Every forecast involves assumptions; a good forecaster recognizes 
these assumptions explicitly. 

Effective forecasting requires that any projection should be re¬ 
vised continuously and, where possible, requires continuous modifi¬ 
cation of original decisions. 


Ch. 14] FORECASTING WITH TIME SERIES 483 

1410 You Should Now Be Able to Solve 

1. For each of the following problems indicate what systematic 
component(s)—trend, seasonal, cyclical—of the time series must 
be measured to provide a basis for action. 

a) An investor wants to decide, on the basis of past price and earn¬ 
ings movements, whether or not to buy XYZ stock as a long-term 
commitment. 

h) A speculator wants to decide, on the basis of past price and earnings 
movements, whether or not to buy ABC stock for the possibility of 
a quick profit. 

c) An estimate of the demand for steel in 1975 is required as a basis 
for the decision on how much of an increase in capacity should be 
planned. 

d) A company must prepare its monthly production quotas for the 
coming year. 

e) A state wants to estimate the yield of a gasoline tax in the coming 
fiscal year for budgetary purposes. 

2. Explain and discuss the following statement: Forecasting the 
future course of economic events is not wholly a statistical problem, 

3. The following time series data are the average price per com¬ 
mon share for Minnesota Mining and Manufacturing Company 
from 1946 to 1958: 


Year 

Average Price* 
(in Dollars) 

Year 

Average Price* 
(in Dollars) 

1946...... 

6.0 

1953 


1947. 

7.4 

1954 

QA Q 

1948...... 

8.3 

1955 

AO & 

1949. 

10.4 

1956 

TCO.O 

A4 1 

1950. 

15.3 

1957 

7Q ^ 

1951. 

22.8 

1958 

94.8 

1952. 

21.8 



* Average of high and low prices during calendar year; adjusted for stock splits through 1958. 


Plot this time series on arithmetic scale graph paper and on semi- 
logarithmic scale graph paper. If you were to fit a freehand trend 
line to the data as a basis for prediction, which chart would facilitate 
your task more? Explain your answer. 

What type of information would you need before you drew and 
projected a trend line? 

4. The following time series represents a firm's ratios of operating 
income to net sales for 1937-58. Formulate a statistical decision 
rule in terms of a statistic measuring the number of positive dif- 
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ferences between successive observations to provide a basis for the 
decision on whether or not to fit a trend to the series. Apply the 
rule to the data. What is the decision? 


Year 

Ratio 

(in Per Cent) 

Year 

Ratio 

(in Per Cent) 

1Q27 

10.1 

1948. 

10.7 

1938 . ... 

7.9 

1949. 

7.6 

1939 . 

11.3 

1950. 

10.6 

1Q40 

10.8 

1951. 

9.8 

1941 . . • 

9.2 

1952. 

i 8.9 

1942 

5.3 

1953. 

9.3 

1943 

5.9 

1954. 

10.3 

1944 

7.0 

1955. 

9.9 

1945 

6.8 

1956. 

9.9 

1946 . 

2.2 

1957. 

8.7 

1Q47 . . 

11.1 

1958. 

6.8 





5. The following time series lists the net sales of a large food 
company from 1947 through 1957 inclusive: 


Year 

Net Sales 
(Millions of 
Dollars) 

Year 

Net Sales 
(Millions of 
Dollars) 

1947 

112.9 

1953. 

151.0 

1948 . ... 

119.8 

1954. 

146.8 

1Q4Q 

90.0 

1955. 

162.4 

1 QKO 

98.6 

1956. 

166.8 

1 QK1 

125.1 

1957. 

166.1 

1952. 

117.0 




a) Plot this time series on arithmetic scale graph paper. 

b) Fit a least-squares straight-line trend line to the data and draw the 
trend line on the chart constructed in part (a). 

c) Discuss the adequacy of the line as a basis for predicting sales in 
1958; as a basis for predicting sales in 1968. Consider alternatives 
to simply extending the line. 

d) Based upon your discussion in part (c), prepare estimates of sales 
in 1958 and in 1968 for use by the controller in deciding on new 
financing. List all pertinent assumptions made in preparing these 
forecasts. 

e) Critically and constructively analyze the use of time series analysis 
in this type of problem. 

6 . To consider shifts in the level of sales due to factors other 
than the seasonal element, a sales executive might take the ratio of 
sales in January, 1959, to those in January, 1958, the ratio of sales in 
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February, 1959, to those in February, 1958, and so forth. Will these 
ratios be influenced by seasonal factors in any way? Explain your 

answer. J 

. 7 ‘ The f°ll° w ing data represent the sales of a product by a plas¬ 
tics manufacturer, by months, from 1952 through 1958 inclusive: 


MONTHLY SALES 
(Thousands of Dollars) 



1952 

1953 

1954 

1955 

1956 

1957 

1958 

January . 

14.3 

13.9 

14.7 

13.7 
14.0 

1 £ K 


16.1 

17.4 
17.0 

17.8 
17.1 
17.0 

16.8 

15.8 

16.4 
17.7 

18.9 

17.9 
17.3 

17.9 

17.1 

18.3 

February.. 

1^9 

liJ^Z 

March. 

1 p; z 

it).Z 

J6.U 

16.9 

16.1 

17.8 

April. 

1 r; q 

10.o 

18.4 

19.2 

May. 

Id.O 

1/10 

10.1 

17.4 

17.9 

June. 

1 A Q 

14.5 

IQ e 

15.6 

16.2 

18.0 

July. 

AO.Zr 

12.0 

13.3 

14.4 

15.3 

14.3 
14.2 

X A • O 

19 1 

io.b 

i/ii 

15.2 

14.8 

if n 

16.4 

17.5 

August. 

1Z. 1 

14 1 

14.1 

IQ t 

16.3 

16.9 

September . 

XTC, A 

15.2 
1C 9 

10.4 

mo 

Ida) 

16.9 

16.8 

18.5 

October. . . 

10. u 
me 

18.4 

20.1 

18.2 

19.2 

November . 

December . 

lU.Zf 

15.1 

10. 0 
16.2 

1 e Q 

17.7 

17.0 

20.8 

20.1 


lO.U 

lO.o 

10.0 

18.0 

18.8 


a) Plot this time series on arithmetic scale graph paper. Qualitatively 
describe the seasonal pattern. 

b) Compute monthly seasonal indices for these data. Do the results 
bear out your description of the seasonal pattern? 

c) Deseasonalize the data and plot the deseasonalized data on the 
chart prepared in part (a). 

d) 1! y ? U ^ required t0 find th e trend of this series, would you fit 
the trend line to the actual data or the deseasonalized data? Ex¬ 
plain your answer. 

e) What assumptions would you make if you were to apply these in¬ 
dices after 1958? 

/) The company’s sales quota for the product in 1959 is 1225,000. Es¬ 
timate the monthly sales quotas on the basis of the seasonal indices 
you have computed. 


14.11 You Will Also Find That 

The following references are among those which consider some 
of the techniques that may be useful in time series analysis: 

Ceoxton, Frederick E., and Cowden, Dudley J. Applied General Sta¬ 
tistics, pp. 240-392. 2d ed. New York: Prentice-Hall, Inc., 1955. 

Mandell, B. J. Statistics for Management, pp. 277-301. Baltimore 
Maryland: Dangary Publishing Co., 1956. 

Mills, Frederick C. Statistical Methods, pp. 319-425. 3d ed New 
York: Henry Holt & Co., Inc., 1955. 
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Sprowls, R. Clay. Elementary Statistics , pp. 262-331. New York. 
McGraw-Hill Book Co., Inc., 1955. 

Some of the basic economic factors which must be considered in 
any forecasting problem are covered in the following texts: 

Bassie, Y. Lewis. Economic Forecasting. New York: McGraw-Hill Book 
Co., Inc., 1958. 

Bratt, Elmer Clark. Business Cycles and Forecasting . 4th ed. Home- 
wood, Ill.: Richard D. Irwin & Co., Inc., 1953. 
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APPENDIX • A 


Table of Squares, Square Roots, 

and Reciprocals 
N=1 to 1,000* 


The table given on the following ten pages will be found useful 
for numerical calculations. It includes squares, square roots, and 
reciprocals for all the integers from 1 through 1,000. 

The table is particularly useful to determine square roots. For any 
given integer, N, the table gives value of 

VN and VlON 
Thus, for N = 184 the table indicates that 
VN = VI84 = 13.56466 

and that 

VlON = V1840 - 42.89522 

The square roots of other numbers besides those of the integers given 
may be easily determined if relationships such as the following are 
kept in mind: 

vToO N = 10V¥ 

VlOOOiV = ioVIW 

* From Charles D. Hodgman (ed.), C.R.C. Standard Mathematical Tables (11th 
ed.; Cleveland: Chemical Rubber Publishing Company, 1957), by permission. 
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For example, note that 

^.0184 = ■ \j l(mo (184) 

= I5o VIli 

The table indicates that V184 = 13.56466 and hence 

VWS4 = ~ (13.56466) 

= .1356466 

Observe, if you use the table of reciprocals, that zeros immediately 
after the decimal point of the reciprocal usually are written, not in 
the body of the table, but rather in the column caption. For exam¬ 
ple, verify that the reciprocal of 184 is .005434783 (since the caption 
of the column 1/N lists .00 and that should precede all values in 
the column). 
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SQUARES, SQUARE ROOTS, AND RECIPROCALS: 1-100 


N 

iV* 

Vn 

VlO N 

l/N 

1 

1 

1,000 000 

3.162 278 

L.0000000 

2 

4 

1.414 214 

4.472 136 

.5000000 

3 

9 

1.732 051 

5.477 226 

.3333333 

4 

16 

2.000 000 

6.324 555 

.2500000 

5 

25 

2.236 068 

7.071 068 

.2000000 

6 

36 

2.449 490 

7.745 967 

.1666667 

7 

49 

2.645 751 

8.366 600 

. 1428571 

8 

64 

2.828 427 

8.944 272 

.1250000 

9 

81 

3.000 000 

9.486 833 

.1111111 

10 

100 

3.162 278 

10.00000 

.1000000 

11 

121 

3.316 625 

10.48809 

.09090909 

12 

144 

3.464 102 

10.95445 

.08333333 

13 

169 

3.605 551 

11.40175 

.07692308 

14 

196 

3.741 657 

11.83216 

.07142857 

15 

225 

3.872 983 

12.24745 

.06666667 

16 

256 

4 000 000 

12.64911 

.06250000 

17 

289 

4.123 106 

13.03840 

.05882353 

18 

324 

4.242 641 

13.41641 

.05555556 

19 

361 

4.358 899 

13.78405 

.05263158 

20 

400 

4.472 136 

14.14214 

.05000000 

21 

441 

4.582 576 

14.49138 

.04761905 

22 

484 

4.690 416 

14.83240 

.04545455 

23 

529 

4.795 832 

15.16575 

.04347826 

24 

576 

4.898 979 

15.49193 

.04166667 

25 

625 

5.000 000 

15.81139 

.04000000 

26 

676 

5.099 020 

16.12452 

.03846154 

27 

729 

5.196 152 

16.43168 

.03703704 

28 

784 

5.291 503 

16.73320 

.03571429 

29 

841 

5.385 165 

17.02939 

.03448276 

30 

900 

5.477 226 

17.32051 

.03333333 

31 

961 

5.567 764 

17.60682 

.03225806 

32 

1 024 

5.656 854 

17.88854 

.03125000 

33 

1 089 

5.744 563 

18.16590 

.03030303 

34 

1 156 

5.830 952 

18.43909 

.02941176 

35 

1 225 

i 5.916 080 

18.70829 

.02857143 

36 

1 296 

i 6.000 000 

18.97367 

.02777778 

37 

1 369 

i 6.082 763 

; 19.23538 

.02702703 

38 

1 444 

> 6.164 414 

: 19.49359 

.02631579 

39 

1 521 

6.244 90S 

! 19.74842 

.02564103 

40 

1 60C 

► 6.324 55£ 

>20.00000 

,02500000 

41 

1 681 

L 6.403 124 

l 20.24846 

.02439024 

42 

1 764 

l 6.480 741 

. 20.49390 

.02380952 

43 

1 841 

> 6.557 43£ 

> 20.73644 

.02325581 

44 

1 93( 

1 6.633 25( 

) 20.97618 

.02272727 

45 

2 02/ 

5 6.708 2G4 

121.21320 

.02222222 

46 

2 IK 

3 6,782 33( 

) 21.44761 

.02173913 

47 

2 20S 

} 6.855 65/ 

5 21.67948 

.02127660 

48 

2 30- 

1 6.928 20/ 

S 21.90890 

.02083333 

49 

2 40: 

l 7.000 00< 

3 22.13594 

.02040816 

50 

2 501 

D 7.071 06! 

S 22.36068 

.02000000 


N 

N * 

Vn 

VlOJV’ 

1/JST 

.0 

50 

2 500 

7.071 068 

22.36068 

2000000 

51 

2 601 

7.141 428 

22.58318 

1960784 

52 

2 704 

7.211 103 

22.80351 

1923077 

53 

2 809 

7.280 110 

23.02173 

1886792 

54 

2 916 

7.348 469 

23.23790 

1851852 

55 

3 025 

7.416 198 

23.45208 

1818182 

56 

3 136 

7.483 315 

23.66432 

1785714 

57 

3 249 

7.549 834 

23.87467 

1754386 

58 

3 364 

7.615 773 

24.08319 

1724138 

59 

3 481 

7.681 146 

24.28992 

1694915 

60 

3 600 

7.745 967 

24.49490 

1666667 

61 

3 721 

7.810 250 

24.69818 

1639344 

62 

3 844 

7.874 008 

24.89980 

1612903 

63 

3 969 

7.937 254 

25.09980 

1587302 

64 

4 096 

8.000 000 

25.29822 

1562500 

65 

4 225 

8.062 258 

25.49510 

1538462 

66 

4 356 

8.124 038 

25.69047 

1515152 

67 

4 489 

8.185 353 

25.88436 

1492537 

68 

4 624 

8.246 211 

26.07681 

1470588 

69 

4 761 

8.306 624 

26.26785 

1449276 

70 

4 900 

8.366 600 

26.45761 

1428571 

71 

5 041 

8.426 150 

26.64583 

1408451 

72 

5 184 

8.485 281 

26.83282 

1388889 

73 

5 329 

8.544 004 

27.01851 

1369863 

74 

5 476 

8.602 325 

27.20294 

1361351 

75 

5 625 

8.660 254 

27.38613 

1333333 

76 

5 776 

8.717 798 

27.56810 

1315789 

77 

6 929 

8.774 964 

27.74887 

1298701 

78 

6 084 

8.831 761 

27.92848 

1282061 

79 

6 241 

8.888 194 

28.10694 

1266823 

80 

6 400 

8.944 272 

28.28427 

1260000 

81 

6 561 

9.000 000 

28.46050 

1234568 

82 

6 724 

9.055 385 

28.63564 

1219512 

83 

6 889 

9.110 434 

28.80972 

1204819 

84 

7 056 

9.165 151 

28.98275 

1190476 

85 

7 225 

9.219 544 

29.15476 

1176471 

86 

7 396 

9.273 618 

29.32576 

1162791 

87 

7 569 

9.327 379 

29.49576 

1149425 

88 

7 744 

: 9.380 832 

29.66479 

1136364 

89 

7 921 

9.433 981 

29.83287 

1123596 

90 

8 10C 

l 9.486 833 

! 30.00000 

1111111 

91 

8 281 

. 9.539 392 

! 30.16621 

1098901 

92 

8 464 

; 9.591 663 

1 30.33150 

1086957 

93 

8 64? 

i 9.643 651 

. 30.49590 

1075269 

94 

8 83t 

1 9.695 36C 

> 30.65942 

1063830 

95 

9 02/ 

5 9.746 794 

l 30.82207 

1052632 

96 

9 21( 

1 9.797 95£ 

) 30.98387 

1041667 

97 

9 401 

) 9.848 85/ 

i 31.14482 

1030928 

98 

9 60< 

L 9.899 49/ 

5 31.30495 

1020408 

99 

9 80: 

l 9.949 87< 

l 31.46427 

1010101 

100 

10 00< 

) 10.00000 

3162278 

1000000 
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SQUARES, SQUARE ROOTS, AND RECIPROCALS: 100-200 


N 

N * 

Vn 

Via N 

1/N 

.0 


N 

N * 

Vn 

View 

1/N 

.00 

100 

10 000 

10.00000 

31.62278 

10000000 


150 

22 500 

12.24745 

38.72983 

6666667 

101 

10 201 

10.04988 

31.78050 

09900990 


151 

22 801 

12.28821 

38.85872 

6622517 

102 

10 404 

10.09950 

31.93744 

09803922 


152 

23 104 

12.32883 

38.98718 

6578947 

103 

10 609 

30.14889 

32.09361 

09708738 


153 

23 409 

12.36932 

39.11521 

6535948 

104 

10 816 

10.19804 

32.24903 

09615385 


154 

23 716 

12,40967 

39.24283 

6493506 

105 

11 025 

10.24695 

32.40370 

09523810 


155 

24 025 

12.44990 

39.37004 

6451613 

106 

11 236 

10.29563 

32.55764 

09433962 


156 

24 336 

12,49000 

39.49684 

6410256 

107 

11 449 

10.34408 

32.71085 

09345794 


157 

24 649 

12.52996 

39.62323 

6369427 

108 

11 664 

10.39230 

32.86335 

09259259 


158 

24 964 

12.56981 

39.74921 

6329114 

109 

11 881 

10.44031 

33,01515 

09174312 


159 

25 281 

12.60952 

39.87480 

6289308 

110 

12 100 

10.48809 

33.16625 

09090909 


160 

25 600 

12.64911 

40.00000 

6250000 

111 

12 321 

10.53565 

33.31666 

09009009 


161 

25 921 

12.68858 

40.12481 

6211180 

112 

12 544 

10.58301 

33.46640 

08928571 


162 

26 244 

12.72792 

40.24922 

6172840 

113 

12 769 

10.63015 

33.61547 

08849558 


163 

26 569 

12.76715 

40!37326 

6134969 

114 

12 996 

10.67708 

33.76389 

08771930 


164 

26 896 

12.80625 

40.49691 

6097561 

115 

13 225 

10.72381 

33.91165 

08695652 


165 

27 225 

12.84523 

40.62019 

6060606 

116 

13 456 

10.77033 

34.05877 

08620690 


166 

27 556 

12.88410 

40.74310 

6024096 

117 

13 689 

10.81665 

34.20526 

08547009 


167 

27 889 

12.92285 

40.86563 

5988024 

118 

13 924 

10.86278 

34.35113 

08474576 


168 

28 224 

12.96148 

40.98780 

5952381 

119 

14 161 

10.90871 

34.49638 

08403361 


169 

28 561 

13.00000 

41.10961 

5917160 

120 

14 400 

10.95445 

34.64102 

08333333 


170 

28 900 

13.03840 

41.23106 

5882353 

121 

14 641 

11.00000 

34.78505 

08264463 


171 

29 241 

13.07670 

41.35216 

5847953 

122 

14 884 

11.04536 

34.92850 

08196721 


172 

29 584 

13.11488 

41.47288 

5813953 

123 

15 129 

11.09054 

35.07136 

08130081 


1/3 

29 929 

13.16295 

41.59327 

5780347 

124 

15 376 

11.13553 

35.21363 

08064516 


174 

30 276 

13.19091 

41.71331 

5747126 

125 

15 625 

11.18034 

35.35534 

08000000 


176 

30 625 

13.22876 

41.83300 

5714286 

126 

16 876 

11.22497 

35.49648 

07936508 


176 

30 976 

13.26650 

41.06235 

5681818 

127 

16 129 

11.26943 

35.63706 

07874016 


177 

31 329 

13.30413 

42.07137 

5649718 

128 

16 384 

11.31371 

35.77709 

07812500 


178 

31 684 

13.34166 

42.19005 

5617978 

129 

16 641 

11.35782 

35.91657 

07751938 


179 

32 041 

13.37909 

42.30839 

5586592 

130 

16 900 

11.40175 

36.05551 

07692308 


180 

32 400 

13.41641 

42.42641 

5655556 

131 

17 161 

11.44552 

36.19392 

07633588 


181 

32 761 

13.45362 

42.54409 

5524862 

132 

17 424 

11.48913 

36.33180 

07575758 


182 

33 124 

13.49074 

42.66146 

5494505 

133 

17 689 

11.53256 

36.46917 

07518797 


183 

33 489 

13.52775 

42.77850 

5464481 

134 

17 956 

11.67584 

36.60601 

07462687 


184 

33 856 

13.66466 

42:89522 

5434783 

135 

18 225 

11.61895 

36.74235 

07407407 


185 

34 225 

13.60147 

43.01163 

5405405 

136 

18 496 

11.66190 

36.87818 

07352941 


186 

34 596 

13.63818 

43.12772 

5376344 

137 

18 769 

11.70470 

37.01351 

07299270 


187 

34 969 

13.67479 

43.24350 

5347594 

138 

19 044 

11.74734 

37.14835 

07246377 


188 

35 344 

13.71131 

43.35897 

5319149 

139 

19 321 

11.78983 

37.28270 

07194245 


189 

35 721 

13.74773 

43.47413 

5291005 

140 

19 600 

11.83216 

37.41667 

07142857 


190 

36 100 

13.78405 

43.58899 

5263158 

141 

19 881 

11.87434 

37.54997 

07092199 


191 

36 481 

13.82027 

43.70355 

5235602 

142 

20 164 

11.91638 

37.68289 

07042254 


192 

36 864 

13.85641 

43.81780 

5208333 

143 

20 449 

11.95826 

37.81534 

06993007 


193 

37 249 

13.89244 

43.93177 

6181347 

144 

20 736 

12.00000 

37.94733 

06944444 


194 

37 636 

13.92839 

44.04543 

5154639 

145 

21 025 

12.04159 

38.07887 

06896552 


195 

38 025 

13.96424 

44.15880 

5128205 

146 

21 316 

12.08305 

38.20995 

06849315 


196 

38 416 

14.00000 

44.27189 

5102041 

147 

21 609 

12.12436 

38.34058 

06802721 


197 

38 809 

14.03567 

44.38468 

5076142 

148 

21 904 

12.16553 

38.47077 

06756757 


198 

39 204 

14.07125 

44.49719 

5050505 

149 

22 201 

12.20656 

38.60052 

06711409 


199 

39 601 

14.10674 

44.60942 

5025126 

150 

22 500 

12.24745 

38.72983 

06666667 


200 

40 000 

14.14214 

44.72136 

5000000 
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STATISTICS FOR BUSINESS DECISIONS 

SQUARES, SQUARE ROOTS, AND RECIPROCALS: 200-300 


N 


Vn 

VioN 

1/N 

.00 


N 

N* 

Vn 

VlON 

1/N 

.00 

200 

40 000 

14.14214 

44.72136 

5000000 


250 

62 500 

15.81139 

50.00000 

4000000 

201 

40 401 

14.17745 

44.83302 

4975124 


251 

63 001 

15.84298 

50.09990 

3984064 

202 

40 804 

14.21267 

44.94441 

4950495 


252 

63 504 

15.87451 

50.19960 

3968254 

203 

41 209 

14.24781 

45.05552 

4926108 


253 

64 009 

15.90597 

50.29911 

3952569 

204 

41 616 

14.28286 

45.16636 

4901961 


254 

64 516 

15.93738 

50.39841 

3937008 

205 

42 025 

14.31782 

45.27693 

4878049 


255 

65 025 

15.96872 

50.49752 

3921569 

206 

42 436 

14.35270 

45.38722 

4854369 


256 

65 536 

16.00000 

50.59644 

3906250 

207 

42 849 

14.38749 

45.49725 

4830918 


257 

66 049 

16.03122 

50.69517 

3891051 

208 

43 264 

14.42221 

45.60702 

4807692 


258 

66 564 

16.06238 

50.79370 

3875969 

209 

43 681 

14.45683 

45.71652 

4784689 


259 

67 081 

16.09348 

50.89204 

3861004 

210 

44 100 

14.49138 

45.82576 

4761905 


260 

67 600 

16.12452 

50.99020 

3846154 

211 

44 521 

14.52584 

45.93474 

4739336 


261 

68 121 

16.15549 

51.08816 

3831418 

212 

44 944 

14.56022 

46.04346 

4716981 


262 

68 644 

16.18641 

51.18594 

3816794 

213 

45 369 

14.59452 

46.15192 

4694836 


263 

69 169 

16.21727 

51.28353 

3802281 

214 

45 796 

14.62874 

46.26013 

4672897 


264 

69 696 

16.24808 

51.38093 

3787879 

215 

46 225 

14.66288 

46.36809 

4651163 


265 

70 225 

16.27882 

51.47815 

3773585 

216 

46 656 

14.69694 

46.47580 

4629630 


266 

70 756 

16.30951 

51.57519 

3759398 

217 

47 089 

14.73092 

46.58326 

4608295 


267 

71 289 

16.34013 

51.67204 

3745318 

218 

47 524 

14.76482 

46.69047 

4587156 


268 

71 824 

16.37071 

51.76872 

3731343 

219 

47 961 

14.79865 

46.79744 

4566210 


269 

72 361 

16.40122 

51.86521 

3717472 

220 

48 400 

14.83240 

46.90416 

4545455 


270 

72 900 

16.43168 

51.96152 

3703704 

221 

48 841 

14.86607 

47.01064 

4524887 


271 

73 441 

16.46208 

52.05766 

3690037 

222 

49 284 

14.89966 

47.11688 

4504505 


272 

73 984 

16.49242 

52.15362 

3676471 

223 

49 729 

14.93318 

47.22288 

4484305 


273 

74 529 

16.52271 

52.24940 

3663004 

224 

50 176 

14.96663 

47.32864 

4464286 


274 

75 076 

16.55295 

52.34501 

3649635 

225 

50 625 

15.00000 

47.43416 

4444444 


275 

i 75 625 

16.58312 

52.44044 

3636364 

226 

51 076 

15.03330 

47.53946 

4424779 


276 

76 176 

16.61325 

52.53570 

3623188 

227 

51 529 

15.06652 

47.64452 

4405286 


277 

76 729 

16.64332 

52.63079 

3610108 

228 

51 984 

15.09967 

47.74935 

4385965 


278 

77 284 

16.67333 

52.72571 

3597122 

229 

52 441 

15.13275 

47.85394 

4366812 


279 

77 841 

16.70329 

52.82045 

3584229 

230 

52 900 

15.16575 

47.95832 

4347826 


280 

78 400 

16.73320 

52.91503 

3571429 

231 

53 361 

15.19868 

48.06246 

4329004 


281 

78 961 

16.76305 

53.00943 

3558719 

232 

53 824 

15.23155 

48.16638 

4310345 


282 

79 524 

16.79286 

53.10367 

3546099 

233 

54 289 

15.26434 

48.27007 

4291845 


283 

80 089 

16.82260 

53.19774 

3533569 

234 

54 756 

15.29706 

48.37355 

4273504 


284 

80 656 

16.85230 | 

53.29165 

3521127 

235 

55 225 

15.32971 

48.47680 

4255319 


285 

81 225 

16.88194 

53.38539 

3508772 

236 

55 696 

15.36229 

48.57983 

4237288 


286 

81 796 

16.91153 

53.47897 

3496503 

237 

56 169 

15.39480 

48.68265 

4219409 


287 

82 369 

16.94107 

53.57238 

3484321 

238 

56 644 

15.42725 

48.78524 

4201681 


288 

82 944 

16.97056 

53.66563 

3472222 

239 

57 121 

15.45962 

48.88763 

4184100 


289 

83 521 

17.00000 

53.75872 

3460208 

240 

57 600 

15.49193 

48.98979 

4166667 


290 

84 100 

17.02939 

53.85165 

3448276 

241 

58 081 

15.52417 

49.09175 

4149378 


291 

84 681 

17.05872 

53.94442 

3436426 

242 

58 564 

15.55635 

49.19350 

4132231 


292 

85 264 

17.08801 

54.03702 

3424658 

243 

59 049 

15.58846 

49.29503 

4115226 


293 

85 849 

17.11724 

54.12947 j 

3412969 

244 

59 536 

15.62050 

49.39636 

4098361 


294 

86 436 

17.14643 

54.22177 1 

3401361 

245 

60 025 

15.65248 

49.49747 

4081633 


295 

87 025 

17.17556 

54.31390 ^ 

3389831 

246 

60 516 

15.68439 

49.59839 

4065041 


296 

87 616 

17.20465 

54.40588 

3378378 

247 

61 009 

15.71623 

49.69909 

4048583 


297 

88 209 

17.23369 

54.49771 

3367003 

248 

61 504 

15.74802 

49.79960 

4032258 


298 

88 804 

17.26268 

54.58938 

3355705 

249 

62 001 

15.77973 

49.89990 

4016064 


299 

89 401 

17.29162 

54.68089 

3344482 

250 

62 500 

15.81139 

60.00000 

4000000 


300 

90 000 

17.32051 

54.77226 

3333333 






APPENDIXES 


493 


SQUARES, SQUARE ROOTS, AND RECIPROCALS: 300-400 



N * 

Vn 

Viojst 

1/2V 

.00 

300 

90 

000 

17.32051 

54.77226 

3333333 

301 

90 

601 

17.34935 

54.86347 

3322259 

302 

91 

204 

17.37815 

54.95453 

3311258 

303 

91 

809 

17.40690 

55.04544 

3300330 

304 

92 

416 

17.43560 

55.13620 

3289474 

305 

93 

025 

17.46425 

55.22681 

3278689 

306 

93 

636 

17.49286 

55.31727 

3267974 

307 

94 

249 

17.52142 

55.40758 

3257329 

308 

94 

864 

17.54993 

55.49775 

3246753 

309 

95 

481 

17.57840 

55.58777 

3236246 

310 

96 

100 

17.60682 

55.67764 

3225806 

311 

96 

721 

17.63519 

55.76737 

3215434 

312 

97 

344 

17.66352 

55.85696 

3205128 

313 

97 

969 

17.69181 

55.94640 

3194888 

314 

98 

596 

17.72005 

56.03570 

3184713 

315 

99 

225 

17.74824 

56.12486 

3174603 

316 

99 

856 

17.77639 

56.21388 

3164557 

317 

100 

489 

17.80449 

56.30275 

3154574 

318 

101 

124 

17.83255 

56.39149 

3144654 

319 

101 

761 

17.86057 

56.48008 

3134796 

320 

102 

400 

17.88854 

56.56854 

3125000 

321 

103 

041 

17.91647 

56.65686 

3115265 

322 

103 

684 

17.94436 

56.74504 

3105590 

323 

104 

329 

17.97220 

56.83309 

3095975 

324 

104 

976 

18.00000 

56.92100 

3086420 

325 

105 

625 

18.02776 

57.00877 

3076923 

326 

106 

276 

18.05547 

57.09641 

3067485 

327 

106 

929 

18.08314 

57.18391 

3058104 

328 

107 

584 

18.11077 

57.27128 

3048780 

329 

108 

241 

18.13836 

57.35852 

3039514 

330 

108 

900 

18.16590 

57.44563 

3030303 

331 

109 

561 

18.19341 

57.53260 

3021148 

332 

110 

224 

18 22087 

57.61944 

3012048 

333 

110 

889 

18.24829 

57.70615 

3003003 ! 

334 

111 

556 

18.27567 

57.79273 

2994012 | 

335 

112 

225 

18.30301 

57.87918 

2985075 

336 

112 

896 

18.33030 

57.96551 

2976190 

337 

113 

569 

18.35756 

58.05170 

2967359 

338 

114 

244 

18.38478 

58.13777 

2958580 

339 

114 

921 

18.41195 

58.22371 

2949853 

340 

115 

600 

18.43909 

58.30952 

2941176 

341 

116 

281 

18.46619 

58.39521 

2932551 

342 

116 

964 

18.49324 

58.48077 

2923977 

343 

117 

649 

18.52026 

58.56620 

2915452 

344 

118 

336 

18.54724 

58.65151 

2906977 

345 

119 

025 

18.57418 

58.73670 

2898551 

346 

119 

716 

18.60108 

58.82176 

2890173 

347 

120 

409 

18.62794 

58.90671 

2881844 

348 

121 

104 

18.65476 

58.99152 

2873563 

349 

121 

801 

18.68154 

59.07622 

2865330 

350 

122 

500 

18.70829 

59.16080 

2857143 


350 122 500 18.70829 

351 123 201 18.73499 

352 123 904 18.76166 

353 124 609 18.78829 

354 125 316 18.81489 


355 126 025 

356 126 736 

357 127 449 

358 128 164 

359 128 881 

360 129 600 

361 130 321 

362 131 044 

363 131 769 

364 132 496 

365 133 225 

366 133 956 

367 134 689 

368 135 424 

369 136 161 

370 136 900 

371 137 641 

372 138 384 

373 139 129 

374 139 876 

375 140 625 

376 141 376 

377 142 129 

378 142 884 

379 143 641 

380 144 400 

381 145 161 

382 145 924 

383 146 689 

384 147 456 

385 148 225 

386 148 996 
387149 769 

388 150 544 

389 151 321 

390 152 100 

391 152 881 

392 153 664 

393 154 449 

394 155 236 

395 156 025 

396 156 816 

397 157 609 

398 158 404 

399 159 201 

400 160 000 


18.84144 

18.86796 

18.89444 

18.92089 

18.94730 

18.97367 

19.00000 

19.02630 

19.05256 

19.07878 

19.10497 

19.13113 

19.15724 

19.18333 

19.20937 

19.23538 

19.26136 

19.28730 

19.31321 

19.33908 

19.36492 

19.39072 

19.41649 

19.44222 

19.46792 

19.49359 

19.51922 

19.54483 

19.57039 

19.59592 

19.62142 

19.64688 

19.67232 

19.69772 

19.72308 

19.74842 

19,77372 

19.79899 

19.82423 

19.84943 

19.87461 

19.89975 

19.92486 

19.94994 

19.97498 


View 

1/N 

.00 

59.16080 

2857143 

59.24525 

2849003 

59.32959 

2840909 

59.41380 

2832861 

59.49790 

2824859 

59.58188 

2816901 

59.66574 

2808989 

59.74948 

2801120 

59.83310 

2793296 

59.91661 

2785515 

60.00000 

2777778 

60.08328 

2770083 

60.16644 

2762431 

60.24948 

2754821 

60.33241 

2747263 

60.41523 

2739726 

60.49793 

2732240 

60.58052 

2724796 

60.66300 

2717391 

60.74537 

2710027 

60.82763 

2702703 

60.90977 

2695418 

60.99180 

2688172 

61.07373 

2680965 

61.15554 

2673797 

61.23724 

2666667 

61.31884 

2659574 

61.40033 

2652520 

61.48170 

2645503 

61.56298 

2638522 

61.64414 

2631579 

61.72520 

2624672 

61.80615 

2617801 

61.88699 

2610966 

61.96773 

2604167 

62.04837 

2597403 

62.12890 

2590674 

62.20932 

2583979 

62.28965 

2577320 

62.36986 

2570694 

62.44998 

2564103 

62.52999 

2557545 

62.60990 

2551020 

62.68971 

2544529 

62.76942 

2538071 

62.84903 

2531646 

62.92853 

2525253 

63.00794 

2518892 

63.08724 

2512563 

63.16645 

2506266 

63.24555 

2500000 
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STATISTICS FOB BUSINESS DECISIONS 


SQUARES, SQUARE ROOTS, AND RECIPROCALS: 400-500 
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SQUARES, SQUARE ROOTS, AND RECIPROCALS: 500-600 


N 

N* 

Vn 

VTojv - 

1/N 

.00 


N 

N* 

Vn 

Vion 

1/N 

.00 


wpi 

22.36068 

70.71068 

2000000 


550 

302 500 

23.45208 

74.16198 

1818182 

501 


22.38303 

70.78135 

1996008 


551 

303 601 

23.47339 

74.22937 

1814882 

502 


22.40536 

70.85196 

1992032 


552 

304 704 

23.49468 

74.29670 

1811594 

503 

wwWiitfll 

22.42766 

70.92249 

1988072 


553 

305 809 

23.51595 

74.36397 

1808318 

504 

254 016 

22.44994 

70.99296 

1984127 


554 

306 916 

23.53720 

74.43118 

1805054 

505 

255 025 

22.47221 

71.06335 

1980198 


555 

308 025 

23.55844 

74.49832 

1801802 

506 

MrtiMlMil 

22.49444 

71.13368 

1976285 


556 

309 136 

23.57965 

74.56541 

1798561 

507 

257 049 

22.51666 

71.20393 

1972387 


557 

310 249 

23.60085 

74.63243 

1795332 

508 

258 064 

22.53886 

71.27412 

1968504 


558 

311 364 

23.62202 

74.69940 

1792115 

509 



71.34424 

1964637 


559 

312 481 

23.64318 

74.76630 

1788909 

510 


22.58318 

71.41428 

1960784 


560 

313 600 

23.66432 

74.83315 

1785714 

611 

261 121 

22.60531 

71.48426 

1956947 


561 

314 721 

23.68544 

74.89993 

1782531 

i3 

262 144 

22.62742 

71.55418 

1953125 


562 

315 844 

23.70654 

74.96666 

1779359 

[,] 


22.64950 

71.62402 

1949318 


563 

316 969 

23.72762 

75.03333 

1776199 

F M 

264 196 

22.67157 

71.69379 

1945525 


564 

318 096 

23.74868 

75,09993 

1773050 

515 

265 225 

22.69361 

71.76350 

1941748 


565 

319 225 

23.76973 

75.16648 

1769912 

516 

266 256 

22.71563 

71.83314 

1937984 


566 

320 356 

23.79075 

75.23297 

1766784 

517 

vmmHVto 

22.73763 

71.90271 

1934236 


567 

321 489 

23.81176 

75,29940 

1763668 

518 

268 324 

22.75961 

71.97222 

1930502 


568 

322 624 

23.83275 

•75.36577 

1760563 

519 

269 361 

22.78157 

72.04165 

1926782 


569 

323 761 

23.85372 

75.43209 

1757469 

520 


22.80351 

72.11103 

1923077 


570 

324 900 

23.87467 

75.49834 

1754386 

521 

271 441 

22.82542 

72.18033 

1919386 


571 

326 041 

23.89561 

75.56454 

1751313 

522 

272 484 

22.84732 

72.24957 

1915709 


572 

327 184 

23.91652 

75.63068 

1748252 

523 

273 529 


72.31874 

1912046 


573 

328 329 

23.93742 

75.69676 

1745201 

524 

274 576 

22.89105 

72.38784 

1908397 


574 

329 476 

23.95830 

75.76279 

1742160 

525 

275 625 

22.91288 

72.45688 



575 

330 625 

23.97916 

75.82875 

1739130 


276 676 

22.93469 

72.52586 

1901141 


576 

331 776 

24.00000 

75.89466 

1736111 

527 

277 729 

22.95648 

72.59477 

1897533 


577 

332 929 

24.02082 

75.96052 

1733102 

528 

278 784 

22.97825 

72.66361 



578 

334 084 

24.04163 

76.02631 

1730104 

529 

279 841 


72.73239 



579 

335 241 

24.06242 

76.09205 

1727116 

530 


23.02173 

72.80110 

1886792 


580 

336 400 

24.08319 

76.15773 

1724138 

531 

281 961 

23.04344 

72.86975 

1883239 


581 

337 561 

24.10394 

76,22336 

1721170 


283 024 

23.06513 

72.93833 

1879699 


582 

338 724 

24.12468 

76.28892 

1718213 

EaSl 

284 089 

23.08679 

73.00685 

1876173 


583 

339 889 

24.14539 

76.35444 

1715266 

534 


23.10844 

73.07530 

1872659 


584 

341 056 

24.16609 

76.41989 

1712329 

535 

286 225 

23.13007 

73.14369 

1869159 


585 

342 225 

24.18677 

76.48529 

1709402 

536 

287 296 

23.15167 

73.21202 

1865672 


586 

343 396 

24.20744 

76.55064 

1706485 

537 

288 369 

23.17326 

73.28028 

1862197 


587 

344 569 

24.22808 

76.61593 

1703578 

538 

289 444 

23.19483 

73.34848 

1858736 


588 

345 744 

24.24871 

76.68116 

1700680 

539 

290 521 

23.21637 

73.41662 

1855288 


589 

346 921 

24.26932 

76.74634 

1697793 

540 


23.23790 

73.48469 

1851852 


590 

348 100 

24.28992 

76.81146 

1694915 

541 

292 681 

23.25941 


1848429 


591 

349 281 

24.31049 

76.87652 

1692047 

542 

293 764 

23.28089 

73.62065 

1845018 


592 

350 464 

24.33105 

76.94154 

1689189 

543 

294 849 

23.30236 

73.68853 

1841621 


593 

351 649 

24.35159 

77.00649 

1686341 

544 

295 936 

23.32381 

73.75636 

1838235 


594 

352 836 

24.37212 

77.07140 

1683502 

545 


23.34524 

73.82412 

1834862 


595 

354 025 

24.39262 

77.13624 

1680672 


298 116 

23.36664 

73.89181 

1831502 


596 

355 216 

24,41311 

77.20104 

1677852 

547 

299 209 

23.38803 

73.95945 

1828154 


597 

356 409 

24.43358 

77.26578 

1675042 

548 


23.40940 


1824818 


598 

357 604 

24.45404 

77.33046 

1672241 

549 

301 401 

23.43075 

74.09453 

1821494 


599 

358 801 

24.47448 

77.39509 

1669449 


302 500 


74.16198 

1818182 


600 

360 000 

24.49490 

77.45967 

1666667 


[iahTw 















































































STATISTICS FOR BUSINESS DECISIONS 

SQUARES, SQUARE ROOTS, AND RECIPROCALS: 600-700 
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SQUARES, SQUARE ROOTS, AND RECIPROCALS: 700-800 


N 

n* 

Vn 

VlON 

1/N 

.00 

700 

490 000 

20.45751 

83.66600 

1428571 

701 

491 

401 

26.47640 

83.72574 

1426534 

702 

492 

804 

26.49528 

83.78544 

1424501 

703 

494 209 

26.51415 

83.84510 

1422475 

704 

495 616 

26.53300 

83.90471 

1420455 

705 

497 025 

26.55184 

83.96428 

1418440 

706 

498 436 

26.57066 

84.02381 

1416431 

707 

499 

849 

26.58947 

84.08329 

1414427 

708 

501 

264 

26.60827 

84.14274 

1412429 

709 

502 

681 

26.62705 

84.20214 

1410437 

710 

504 

100 

26.64583 

84.26150 

1408451 

711 

505 

521 

26.66458 

84.32082 

1406470 

712 

506 

944 

26.68333 

84.38009 

1404494 

713 

508 

369 

26.70200 

84.43933 

1402525 

714 

509 

796 

26.72078 

84.49852 

1400560 

715 

511 

225 

26.73948 

84.55767 

1398601 

716 

512 

656 

26.75818 

84.61678 

1396648 

717 

514 

089 

26.77686 

84.67585 

1394700 

718 

515 

524 

26.79552 

84.73488 

1392758 

719 

516 

961 

26.81418 

84.79387 

1390821 

720 

518 400 

26.83282 

84.85281 

1388889 

721 

519 

841 

26.85144 

84.91172 

1386963 

722 

521 

284 

26.87006 

84.97058 

1385042 

723 

522 729 

26.88866 

85.02941 

1383126 

724 

524 

176 

26.90725 

85.08819 

1381215 

725 

525 

625 

26.92582 

85.14693 

1379310 

726 

527 076 

26.94439 

85.20563 

1377410 

727 

528 529 

26.96294 

85.26429 

1375516 

728 

529 984 

26.98148 

85.32292 

1373626 

729 

531 

441 

27.00000 

85.38150 

1371742 

730 

532 

900 

27.01851 

85.44004 

1369863 

731 

534 361 

27.03701 

85.49854 

1367989 

732 

535 

824 

27.05550 

85.55700 

1366120 

733 

537 

289 

27.07397 

85.61542 

1364256 

734 

538 756 

27.09243 

85.67380 

1362398 

735 

540 225 

27.11088 

85.73214 

1360544 

736 

541 

696 

27.12932 

85.79044 

1358696 

737 

543 

169 

27.14774 

85.84870 

1356852 

738 

544 

644 

27.16616 

85.90693 

1355014 

739 

546 

121 

27.18455 

85.96511 

1353180 

740 

547 600 

27.20294 

86.02325 

1351351 

741 

549 

081 

27.22132 

86.08136 

1349528 

742 

550 

564 

27.23968 

86.13942 

1347709 

743 

552 

049 

27.25803 

86.19745 

1345895 

744 

553 

536 

27.27636 

86.25543 

1344086 

745 

555 025 

27.29469 

86.31338 

1342282 

746 

556 

516 

27.31300 

86.37129 

1340483 

747 

558 

009 

27.33130 

86.42916 

1338688 

748 

559 

504 

27.34959 

86.48699 

1336898 

749 

561 

001 

27.36786 

86.54479 

1335113 

750 

562 

500 

27.38613 

86.60254 

1333333 


N 

N * 

Vn 

VTojv’ 

1/N 

,00 

750 

562 

500 

27.38613 

86.60254 

1333333 

751 

564 

001 

27.40438 

86.66026 

1331558 

752 

565 

504 

27.42262 

86.71793 

1329787 

753 

567 

009 

27.44085 

86.77557 

1328021 

754 

568 

516 

27.45906 

86.83317 

1326260 

755 

570 

025 

27.47726 

86.89074 

1324503 

756 

571 

536 

27.49545 

86.94826 

1322751 

757 

573 

049 

27.51363 

87.00575 

1321004 

758 

574 

564* 

27.53180 

87.06320 

1319261 

759 

576 

081 

27.54995 

87.12061 

1317523 

760 

577 

600 

27.56810 

87.17798 

1315789 

761 

579 

121 

27.58623 

87.23531 

1314060 

762 

580 

644 

27.60435 

87.29261 

1312336 

763 

582 

169 

27.62245 

87.34987 

1310616 

764 

583 

696 

27.64055 

87.40709 

1308901 

765 

585 

225 

27.65863 

87.46428 

1307190 

766 

586 

756 

27.67671 

87.52143 

1305483 

767 

588 

289 

27.69476 

87.57854 

1303781 

768 

589 

824 

27.71281 

87.63561 

1302083 

769 

591 

361 

27.73085 

87.69265 

1300390 

770 

592 900: 

27.74887 

87.74964 

1298701 

771 

594 

441 

27.76689 

87.80661 

1297017 

772 

595 

984 

27.78489 

87.86353 ! 

1295337 

773 

597 

529 

27.80288 

87.92042 

1293661 

774 

599 

076 

27.82086 

87.97727 

1291990 

775 

600 

625 

27.83882 

88.03408 

1290323 

776: 

602 

176 

27.85678 

88.09086 

1288660 

777 

603 

729 

27.87472 

88.14760 

1287001 

778 

605 

284 

27.89265 

88.20431 

1285347 

779 

606 

841 

27.91057 

88.26098 

1283697 

780 

608 400 

27,92848 

88.31761 

1282051 

781 

609 961 

27.94638 

88.37420 

1280410 

782 

1611 

524 

27.96426 

88.43076 

1278772 

783 

j613 

089 

27.98214 

88.48729 

1277139 

784 

614 656 

; 28.00000 

88.54377 

1275510 

785 

616 225 

28.01785 

88.60023 

1273885 

786 

617 

796 

28.03569 

88.65664 

1272265 

787 

619 

369 

28.05352 

88.71302 

1270648 

788 

620 

944 

28.07134 

88.76936 

1269036 

.789 

622 

521 

28.08914 

88.82567 

1267427 

790 

624 100 

28.10694 

88.88194 

1265823 

791 

625 681 

28.12472 

88.93818 

1264223 

792 

627 264 

28.14249 

88.99438 

1262626 

793 

628 849 

28.16026 

89.05055 

1261034 

794 

630 436 

28.17801 

89.10668 

1259446 

795 

632 025 

28.19574 

89.16277 

1257862 

796 

633 

616 

28.21347 

89.21883 

1256281 

797 

635 209 

28.23119 

89.27486 

1254705 

798 

636 

804 

28.24889 

89.33085 

1253133 

799 

638 401 

28.26659 

89.38680 

1251564 

800 

640 000 

28.28427 

89.44272 

1250000 
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STATISTICS FOR BUSINESS DECISIONS 


SQUARES, SQUARE ROOTS, AND RECIPROCALS: 800-900 
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SQUARES, SQUARE ROOTS, AND RECIPROCALS: 900-1000 


N 

N* 

Vn 

Vion 

1/N 

.00 

900 

810 00C 

30.00000 

94.86833 

nimi 

901 

811 801 

30.01666 

94.92102 

1109878 

902 

813 604 

30.03331 

94.97368 

1108647 

903 

815 40? 

30.04996 

95.02631 

1107420 

904 

817 2l€ 

30.06659 

95.07891 

1106195 

905 

819 025 

30.08322 

95.13149 

1104972 

906 

820 836 

30.09983 

95.18403 

1103753 

907 

822 649 

30.11644 

95.23655 

1102536 

908 

824 464 

30.13304 

95.28903 

1101322 

909 

826 281 

30.14963 

95.34149 

1100110 

910 

828 100 

30.16621 

95.39392 

1098901 

911 

829 921 

30.18278 

95.44632 

1097695 

912 

831 744 

30.19934 

95.49869 

1096491 

913 

833 569 

30.21589 

95.55103 

1095290 

914 

835 396 

30.23243 

95.60335 

1094092 

915 

837 225 

30.24897 

95.65563 

1092896 

916 

839 056 

30.26549 

95.70789 

1091703 

917 

840 889 

30.28201 

95.76012 

1090513 

918 

842 724 

30.29851 

95.81232 

1089325 

919 

844 561 

30.31501 

95.86449 

1088139 

920 

846 400 

30.33150 

95.91663 

1086957 

921 

848 241 

30.34798 

95.96874 

1085776 

922 

850 084 

30.36445 

96.02083 

1084599 

923 

851 929 

30.38092 

96.07289 

1083424 

924 

853 776 

30.39737 

96.12492 

1082251 

925 

855 625 

30.41381 

96.17692 

1081081 

926 

857 476 

30.43025 

96.22889 

1079914 

927 

859 329 

30.44667 

96.28084 

1078749 

928 

861 184 

30.46309 

96.33276 

1077586 

929 

863 041 

30.47950 

96.38465 

1076426 

930 

864 900 

30.49590 

96.43651 

1075269 

931 

866 761 

30.51229 

96.48834 

1074114 

932 

868 624 

30.52868 

96.54015 

1072961 

933 

870 489 

30.54505 

96.59193 

1071811 

934 

872 356 

30.56141 

96.64368 

1070664 

935 

874 225 

30.57777 

96.69540 

1069519 

936 

876 096 

30.59412 

96.74709 

1068376 

937 

877 969 

30.61046 

96.79876 

1067236 

938 ; 

879 844 

30.62679 

96.85040 

1066098 

939 : 

881 721 

30.64311 

96.90201 

1064963 

940 

883 600 

30.65942 

96.95360 

1063830 

941 

885 481 

30.67572 

97.00515 

1062699 

942 

B87 364 

30.69202 

97.05668 

1061571 

943 

889 249 

30.70831 

97.10819 

1060445 

944 

891 136 

30.72458 

97.15966 

1059322 

945 1 

893 025 

30.74085 

97.21111 

1058201 

946 1 

394 916 

30.75711 

97.26253 

1057082 

947 ! 

396 809 

30.77337 

97.31393 

1055966 

948 1 

398 704 

30.78961 

97.36529 

1054852 

949 i 

J00 601 

30.80584 

97.41663 

1053741 

950 < 

)02 5001 

30.82207 | 

97.46794 

1052632 


N 

N* 

Vn 

vm 

1/N 

.00 

95C 

902 50( 

30.8220i 

97.46794 

1052632 

951 

904 401 

30.83821 

97.5192C 

1051525 

952 

906 304 

30.8545C 

97.57041 

1050420 

953 

908 201 

30.8707C 

97.62172 

1049318 

954 

910.11C 

30.88681 

97.67292 

1048218 

955 

912 025 

30.90307 

97.7241C 

1047120 

956 

913 936 

30.91925 

97.77525 

1046025 

957 

915 849 

30.93542 

97.82638 

1044932 

958 

917 764 

30.95158 

97.87747 

1043841 

959 

919 681 

30.96773 

97.92855 

1042753 

960 

921 600 

30.98387 

97.97959 

1041667 

961 

923 521 

31.00000 

98.03061 

1040583 

962 

925 444 

31.01612 

98.08160 

1039501 

963 

927 369 

31.03224 

98.13256 

1038422 

964 

929 296 

31.04835 

98.18350 

1037344 

965 

931 225 

31.06445 

98.23441 

1036269 

966 

933 156 

31.08054 

98.28530 

1035197 

967 

935 089 

31.09662 

98.33616 

1034126 

968 

937 024 

31.11270 

98.38699 

1033058 

969 

938 961 

31.12876 

98.43780 

1031992 

970 

940 900 

31.14482 

98.48858 

1030928 

971 

942 841 

31.16087 

98.53933 

1029866 

972 

944 784 

31.17691 

98.59006 

1028807 

973 

946 729 

31.19295 

98.64076 

1027749 

974 

948 676 

31,20897 

98.69144 

1026694 

975 

950 625 

31.22499 

98.74209 

1025641 

976 

952 576 

31.24100 

98.79271 

1024590 

977 

954 529 

31.25700 

98.84331 

1023541 

978 

956 484 

31.27299 

98.89388 

1022495 

979 

958 441 

31.28898 

98.94443 

1021450 

980 

960 400 

31.30495 

98.99495 

1020408 

981 

962 361 

31.32092 

99.04544 

1019368 

982 

964 324 

31.33688 

99.09591 

1018330 

983 

966 289 

31.35283 

99.14636 

1017294 

984 

968 256 

31.36877 

99.19677 

1016260 

985 

970 225 

31.38471 

99.24717 

1015228 

986 

972 196 

31.40064 

99.29753 

1014199 

987 

974 169 

31.41656 

99.34787 

1013171 

988 

976 144 

31.43247 

99.39819 

1012146 

989 

978 121 

31.44837 

99.44848 

1011122 

990 

980 100 

31.46427 

99.49874 

1010101 

991 

982 081 

31.48015 

99.54898 

1009082 

992 

984 064 

31.49603 

99.59920 

1008065 

993 

986 049 

31.51190 

99.64939 

1007049 

994 

988 036 

31.52777 

99.69955 

1006036 

995 

990 025 

31.54362 

99.74969 

1005025 

996 

992 016 

31.55947 

99.79980 

1004016 

997 

994 009 

31.57531 

99.84989 

1003009 

998 

996 004 

31.59114 

99.89995 

1002004 

999 

998 001 

31.60696 

99.94999 

1001001 

1000 : 

L 000 000 

31.62278 ; 

LOO.00000 

1000000 


APPENDIX • B 


Areas under the Normal Curve 
and Normal Deviates 

The two tables given on the following pages are used to measure 
areas (relative frequencies or probabilities) under the normal curve. 

Table B.l shows, for the value of a given normal deviate (z), the 
area in the tail of a normal curve; that is, it indicates the area de¬ 
picted below: 
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we simply subtract the appropriate value shown in Table B.l from 
.5000. 

Table B.2 shows, for a given tail area, the value of the correspond¬ 
ing normal deviate. 


TABLE B.l. AREAS UNDER THE NORMAL CURVE 


Nor ¬ 

mal 

De ¬ 

viate 

z 

.00 

.01 

.02 

.03 

.04 

.05 

.06 

.07 

.08 

.09 

0.0 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

1.1 

1.2 

1.3 

1.4 

1.5 

1.6 

1.7 

1.8 

1.9 
2.0 
2.1 
2.2 

2.3 

2.4 

2.5 

2.6 

2.7 

2.8 

2.9 
3.0 

.5000 

.4602 

.4207 

.3821 

.3446 

.3085 

.2743 

.2420 

.2119 

.1841 

.1587 

.1357 

.1151 

.0968 

.0808 

.0668 

.0548 

.0446 

.0359 

.0287 

.0228 

.0179 

.0139 

.0107 

.0082 

.0062 

.0047 

.0035 

.0026 

.0019 

.0013 

.4960 

.4562 

.4168 

.3783 

.3409 

.3050 

.2709 

.2389 

.2090 

.1814 

.1562 

.1335 

.1131 

.0951 

.0793 

.0655 

.0537 

.0436 

.0351 

.0281 

.0222 

.0174 

.0136 

.0104 

.0080 

.0060 

.0045 

.0034 

.0025 

.0018 

.0013 

.4920 

.4522 

.4129 

.3745 

.3372 

.3015 

.2676 

.2358 

.2061 

.1788 

.1539 

.1314 

.1112 

.0934 

.0778 

.0643 

.0526 

.0427 

.0344 

.0274 

.0217 

.0170 

.0132 

.0102 

.0078 

.0059 

.0044 

.0033 

.0024 

.0018 

.0013 

.4880 

.4483 

.4090 

.3707 

.3336 

.2981 

.2643 

.2327 

.2033 

.1762 

.1515 

.1292 

.1093 

.0918 

.0764 

.0630 

.0516 

.0418 

.0336 

.0268 

.0212 

.0166 

.0129 

.0099 

.0075 

.0057 

.0043 

.0032 

.0023 

.0017 

.0012 

.4840 

.4443 

.4052 

.3669 

.3300 

.2946 

.2611 

.2296 

.2005 

.1736 

.1492 

.1271 

.1075 

.0901 

.0749 

.0618 

.0505 

.0409 

.0329 

.0262 

.0207 

.0162 

.0125 

.0096 

.0073 

.0055 

.0041 

.0031 

.0023 

.0016 

.0012 

.4801 

.4404 

.4013 

.3632 

.3264 

.2912 

.2578 

.2266 

.1977 

.1711 

.1469 

.1251 

.1056 

.0885 

.0735 

.0606 

.0495 

.0401 

.0322 

.0256 

.0202 

.0158 

.0122 

.0094 

.0071 

.0054 

.0040 

.0030 

.0022 

.0016 

.0011 

.4761 

.4364 

.3974 

.3594 

.3228 

.2877 

.2546 

.2236 

.1949 

.1685 

.1446 

.1230 

.1038 

.0869 

.0721 

.0594 

.0485 

.0392 

.0314 

.0250 

.0197 

.0154 

.0119 

.0091 

.0069 

.0052 

.0039 

.0029 

.0021 

.0015 

.0011 

.4721 

.4325 

.3936 

.3557 

.3192 

.2843 

.2514 

.2206 

.1922 

.1660 

.1423 

.1210 

.1020 

.0853 

.0708 

.0582 

.0475 

.0384 

.0307 

.0244 

.0192 

.0150 

.0116 

.0089 

.0068 

.0051 

.0038 

.0028 

.0021 

.0015 

.0011 

.4681 

.4286 

.3897 

.3520 

.3156 

.2810 

.2483 

.2177 

.1894 

.1635 

.1401 

.1190 

.1003 

.0838 

.0694 

.0571 

.0465 

.0375 

.0301 

.0239 

.0188 

.0146 

.0113 

.0087 

.0066 

.0049 

.0037 

.0027 

.0020 

.0014 

.0010 

.4641 

.4247 

.3859 

.3483 

.3121 

.2776 

.2451 

.2148 

.1867 

.1611 

.1379 

.1170 

.0985 

.0823 

.0681 

.0559 

.0455 

.0367 

.0294 

.0233 

.0183 

.0143 

.0110 

.0084 

.0064 

.0048 

.0036 

.0026 

.0019 

.0014 

.0010 
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STATISTICS FOR BUSINESS DECISIONS 


TABLE B.2. NORMAL DEVIATES 


Single - 
Tail Area 

Two - 
Tail Area 

z 

Single - 
Tail Area 

Two - 
Tail Area 


.0025 

.005 

2.81 

.06 

.12 

1.55 

.005 

.010 

2.58 

.07 

.14 

1.48 

.010 

.020 

2.33 

.08 

.16 

1.41 

.0125 

.025 

2.24 

.09 

.18 

1.34 

.015 

.030 

2.17 

.10 

.20 

1.28 

.020 

.040 

2.05 

.20 

.40 

.84 

.025 

.050 

1.96 

.25 

.50 

.67 

.030 

.060 

1.88 

.30 

.60 

.52 

.040 

.050 

.080 

.100 

1.75 

1.64 

.40 

.50 

.80 

1.00 

.25 

0 







APPENDIX - C 


Table of 5,000 Random Digits* 


A table of 5,000 random digits is reproduced on the following 
pages. As indicated in Section 4.7, random digits are useful to draw 
probability samples. The digits given come from a much larger table 
compiled for use in sampling studies by The Rand Corporation. 


RANDOM DIGITS 


Line 





Column Number 




No. 

1-5 

6-10 

11-15 

16-20 

21-25 

26-30 

31-35 

36-10 

41-45 

46-50 

0 

93108 

77033 

68325 

10160 

38667 

62441 

87023 

94372 

06164 

30700 

1 

28271 

08589 

83279 

48838 

60935 

70541 

53814 

95588 

05832 

80235 

2 

21841 

35545 

11148 

34775 

17308 

88034 

97765 

35959 

52843 ' 

44895 

3 

22025 

79554 

19698 

25255 

50283 

94037 

57463 

92925 

12042 

91414 

4 

09210 

20779 

02994 

02258 

86978 

85092 

54052 

18354 

20914 

28460 

5 

90552 

71129 

03621 

20517 

16908 

06668 

29916 

51537 

93658 

29525 

6 

01130 

06995 

20258 

10351 

99248 

51660 

38861 

49668 

74742" 

47181 

7 

22604 

56719 

21784 

68788 

38358 

59827 

19270 

99287 

81193 

43366 

8 

06690 

01800 

34272 

65497 

94891 

14537 

91358 

21587 

95765 

72605 

9 

59809 

69982 

71809 

64984 

48709 

43991 

24987 

69246 

86400 

...... ..,T 

29559 | 

10 

56475 

02726 

58511 

95405 

70293 

84971 

06676 

44075 

32338 

31980 ! 

11 

02730 

34870 

83209 

03138 

07715 

31557 

55242 

61308 

26507 

06186 ' 

12 

74482 

33990 

13509 

92588 

10462 

76546 

46097 

01825 

20153 

36271 

13 

19793 

22487 

94238 

81054 

95488 

23617 

15539 

94335 

73822 

93481 

14 

19020 

27856 

60526 

24144 

98021 

60564 

46373 

86928 

52135 

74919 

15 

69565 

60635 

65709 

77887 

42766 

86698 

14004 

94577 

27936 

47220 

16 

69274 

23208 

61035 

84263 

15034 

28717 

76146 

22021 

23779 

98562 

17 

83658 

14204 

09445 

41081 

49630 

34215 

89806 

40930 

97194 

21747 

18 

78612 

51102 

66826 

40430 

54072 

62164 

68977 

95583 

11765 

81072 

19 

14980 

74158 

78216 

38985 

60838 

82836 

42777 

85321 

90463 

11813 


* From The Rand Corporation, A Million Random Digits (Glencoe, ill.: The 
Free Press, 1955), by permission. 
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RANDOM DIGITS 


Line 




Column Number 





No. 

1-5 

6-10 

11-15 

16-20 

21-25 

26-30 

31-35 

36-40 

41-45 

46-50 

20 

63172 

28010 

29405 

91554 

75195 

51183 

65805 

87525 

35952 

83204 

21 

71167 

37984 

52737 

06869 

38122 

95322 

41356 

19391 

96787 

64410 

22 

78530 

56410 

19195 

34434 

83712 

50397 

80920 

15464 

8i350 

18673 

23 

98324 

03774 

07573 

67864 

06497 

20758 

83454 

22756 

83959 

96347 

24 

55793 

30055 

08373 

32652 

02654 

75980 

02095 

87545 

88815 

80086 

25 

05674 

34471 

61967 

91266 

38814 

44728 

32455 

17057 

08339 

93997 

26 

15643 

22245 

07592 

22078 

73628 

60902 

41561 

54608 

41023 

98345 

27 

66750 

19609 

70358 

03622 

64898 

82220 

69304 

46235 

97332 

64539 

28 

42320 

74314 

50222 

82339 

51564 

42885 

50482 

98501 

02245 

88990 

29 

73752 

73818 

15470 

04914 

24936 

65514 

56633 

72030 

30856 

85183 

30 

97546 

02188 

46373 

21486 

28221 

08155 

23486 

66134 

88799 

49496 

31 

32569 

52162 

38444 

42004 

78011 

16909 

94194 

79732 

47114 

23919 

32 

36048 

93973 

82596 

28739 

86985 

58144 

65007 

08786 

14826 

04896 

33 

40455 

36702 

38965 

56042 

80023 

28169 

04174 

65533 

52718 

55255 

34 

33597 

47071 

55618 

51796 

71027 

46690 

08002 

45066 

02870 

60012 

35 

22828 

96380 

35883 

15910 

17211 

42358 

14056 

55438 

98148 

35384 

36 

00631 

95925 

19324 

31497 

88118 

06283 

84596 

72091 

53987 

01477 

37 

75722 

36478 

07634 

63114 

27164 

15467 

03983 

09141 

60562 

65725 

38 

80577 

01771 

61510 

17099 

28731 

41426 

18853 

41523 

14914 

76661 

39 

10524 

20900 

65463 

83680 

05005 

11611 

64426 

59065 

06758 

02892 

40 

93815 

69446 

75253 

51915 

97839 

75427 

90685 

60352 

96288 

34248 

41 

81867 

97119 

93446 

20862 

46591 

97677 

42704 

13718 

44975 

67145 

42 

64649 

07689 

16711 

12169 

15238 

74106 

60655 

56289 

74166 

78561 

43 

55768 

09210 

52439 

33355 

57884 

36791 

00853 

49969 

74814 

09270 

44 

38080 

49460 

48137 

61589 

42742 

92035 

21766 

19435 

92579 

27683 

45 

22360 

16332 

05343 

34613 

24013 

98831 

17157 

44089 

07366 

66196 

46 

40521 

09057 

00239 

51284 

71556 

22605 

41293 

54854 

39736 

05113 

47 

19292 

69862 

59951 

49644 

53486 

28244 

20714 

56030 

39292 

45166 

48 

79504 

40078 

06838 

05509 

68581 

39400 

85615 

52314 

83202 

40313 

49 

64138 

27983 

84048 

42631 

58658 

62243 

82572 

45211 

37060 

15017 

50 

31000 

63837 

17813 

08076 

19164 

95508 

17513 

29416 

61238 

25818 

51 

87014 

83331 

56364 

32768 

85680 

08844 

59844 

68794 

32783 

60318 

52 

47293 

97023 

35804 

69886 

47494 

94574 

45842 

67221 

77115 

43398 

53 

33898 

89236 

06100 

68848 

08674 

87786 

42425 

92091 

86274 

82166 

54 

26877 

95856 

65227 

25165 

01752 

99463 

15216 

28719 

04716 

80246 

55 

53984 

87855 

70753 

80386 

78600 

39244 

76967 

83263 

57849 

85890 

56 

97809 

03548 

00574 

21143 

11605 

30245 

87395 

80966 

28721 

11095 

57 

85683 

79483 

74858 

87491 

57785 

61270 

51111 

50490 

40940 

02832 

58 

07758 

57784 

22934 

17165 

37776 

33361 

79191 

97398 

40881 

98552 

59 

13995 

74006 

90843 

85761 

89037 

13567 

67089 

47435 

75156 

70217 

60 

89630 

94575 

64517 

80897 

04861 

50564 

15287 

94279 

69154 

75473 

61 

20355 

08661 

11092 

19682 

84287 

23597 

48246 

74325 

66320 

90155 

62 

13496 

74367 

32701 

87819 

29050 

90959 

99765 

59374 

45204 

87750 

63 

68419 

49317 

30340 

62744 

45214 

79826 

34043 

73218 

72788 

26143 

64 

61481 

09786 

08768 

93715 

43847 

68901 

51249 

35382 

67776 

06271 
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RANDOM DIGITS 


Line 





Column Number 




No. 

1-5 

6-10 

11-15 

16-20 

21-25 

26-30 

31-35 

36-40 

41-45 

46-50 

65 

71877 

53924 

09630 

62975 

64470 

46832 

40591 

23477 

21063 

31229 

66 

99288 

86530 

86819 

38955 

91744 

89632 

64623 

84986 

85990 

47639 

67 

09047 

83453 

12775 

32000 

60098 

98862 

93422 

66212 

86413 

82884 

68 

83724 

89682 

53950 

41500 

16023 

83862 

88274 

68301 

99673 

40945 

69 

56126 

46618 

87708 

91994 

38384 

27047 

91550 

18903 

45535 

45586 

70 

13810 

51503 

55122 

22997 

82556 

22697 

54985 

64852 

58775 

66737 

71 

85630 

22794 

81623 

10927 

52252 

27384 

03122 

17454 

03250 

23232 

72 

93350 

28643 

39097 

00914 

69844 

06450 

32818 

88752 

28722 

94656 

73 

05002 

71178 

15414 

99874 

58046 

35461 

56349 

81936 

14964 

83638 

74 

70358 

57182 

45747 

30830 

70542 

25932 

16298 

23521 

72454 

11640 

75 

98169 

51114 

18115 

40718 

46082 

75847 

15678 

22842 

44615 

97810 

76 

28010 

95831 

94028 

57041 

75137 

55128 

99785 

33467 

33834 

04860 

77 

05576 

47156 

09300 

11197 

97670 

20064 

98145 

84774 

07907 

10992 

78 

80748 

66311 

79421 

00202 

68501 

40422 

07368 

26795 

24358 

78969 

79 

39746 

23690 

67845 

83962 

09451 

21501 

75317 

09049 

69440 

16137 

80 

39380 

72397 

88106 

07851 

67756 

58042 

44218 

52775 

49082 

54400 

81 

55865 

70318 

58189 

35340 

91500 

80940 

39231 

54836 

18038 

03557 

82 

34223 

72744 

83926 

40078 

80791 

42723 

68340 

47452 

17443 

69289 

83 

91312 

44474 

09925 

41416 

96671 

11213 

60979 

04130 

72380 

73582 

84 

51010 

08021 

53914 

37499 

40228 

92606 

07225 

18014 

71063 

50111 

85 

48890 

58264 

18790 

91535 

04933 

56656 

76389 

37904 

98017 

07663 

86 

24876 

93198 

77444 

53782 

48866 

65614 

11410 

90637 

63675 

43554 

87 

49408 

42923 

28162 

09789 

15155 

66742 

66785 

79065 

93573 

19853 

88 

56307 

90901 

18118 

36505 

01598 

68997 

96338 

40823 

96247 

55573 

89 

48320 

09322 

13457 

84931 

54701 

14878 

05422 

65358 

85636 

31948 

90 

51083 

00619 

05548 

25431 

15175 

82428 

00637 

41814 

68871 

31688 

91 

43249 

71593 

85595 

59834 

01488 

06917 

32858 

80134 

01832 

25905 

92 

00370 

17413 

75537 

79824 

24428 

28941 

58659 

66731 

99940 

27156 

93 

55737 

45462 

12484 

64858 

43581 

06220 

07507 

39119 

38024 

99720 

94 

39551 

38058 

10445 

18463 

80812 

51243 

22351 

63266 

94057 

06573 

95 

02369 

11289 

99499 

32922 

88429 

90484 

42010 

53308 

33206 

36137 

96 

92788 

62546 

26147 

83529 

10012 

77611 

78925 

86071 

66344 

35705 

97 

98650 

95898 

74254 

45173 

17430 

73882 

03411 

88447 

43279 

35057 

98 

95591 

85858 

51058 

26140 

00995 

16881 

87372 

72646 

18796 

58537 

99 

67853 

23563 

41063 

63355 

72454 

16016 

72229 

54720 

09846 

81392 





APPENDIX • D 


Table of Common Logarithms, 100-999 


The table on the following two pages gives the common loga¬ 
rithms of the numbers 100 through 999. Logarithms may be used to 
simplify the arithmetical operations of multiplication, division, 
raising numbers to powers, and extracting roots of numbers. Thus, 
the student in statistics may find them useful, for example, in the 
calculation of probabilities by the binomial formula. Calculations 
with the table of four-place logarithms given should yield results 
accurate to four digits. 

For a discussion of the use of logarithms the student may consult 
almost any elementary college mathematics textbook. For example, 
see 

Cooley, Hollis R., Gans, David, Kline, Morris, and Wahlert, How¬ 
ard E. Introduction to Mathematics, pp. 123-136. 2nd ed. Boston: 
Houghton Mifflin Company, 1949. 
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COMMON LOGARITHMS: 100-549 


N 

0 

1 

2 

3 

4 

5 

6 

7 

8 

- - 

9 

wm 

0000 

0043 

0086 

0128 

0170 

0212 

0253 

0294 

0334 

0374 

mm 

0414 

0453 

0492 

0531 

0569 

0607 

0645 

0682 

0719 

0755 

139 

0792 

0828 

0864 

0899 

0934 

0969 

1004 

1038 

1072 

1106 

EH 

1139 

1173 

1206 

1239 

1271 

1303 

1335 

1367 

1399 

1430 

14 

1461 

1492 

1523 

1553 

1584 

1614 

1644 

1673 

1703 

1732 

15 

1761 

1790 

1818 

1847 

1875 

1903 

1931 

1959 

1987 

2014 

16 

2041 

2068 

2095 

2122 

2148 

2175 

2201 

2227 

2253 

2279 

17 

2304 

2330 

2355 

2380 

2405 

2430 

2455 

2480 

2504 

2529 

18 

2553 

2577 

2601 

2625 

2648 

2672 

2695 

2718 

2742 

2765 

19 

2788 

2810 

2833 

2856 

2878 

2900 

2923 

2945 

2967 

2989 

20 

3010 

3032 

3054 

3075 

3096 

3118 

3139 

3160 

3181 

3201 

21 

3222 

3243 

3263 

3284 

3304 

3324 

3345 

3365 

3385 

3404 

22 

3424 

3444 

3464 

3483 

3502 

3522 

3541 

3560 

3579 

3598 

23 

3617 

3636 

3655 

3674 

3692 

3711 

3729 

3747 

3766 

3784 

24 

3802 

3820 

3838 

3856 

3874 

3892 

3909 

3927 

3945 

3962 

25 

3979 

3997 

4014 

4031 

4048 

4065 

4082 

4099 

4116 

4133 

26 

4150 

4166 

4183 

4200 

4216 

4232 

4249 

4265 

4281 

4298 

27 

4314 

4330 

4346 

4362 

4378 

4393 

4409 

4425 

4440 

4456 

28 

4472 

4487 

4502 

4518 

4533 

4548 

4564 

4579 

4594 

4609 

29 

4624 

4639 

4654 

4669 

4683 

4698 

4713 

4728 

4742 

4757 

30 

4771 

4786 

4800 

4814 

4829 

4843 

4857 

4871 

4886 

4900 

31 

4914 

4928 

4942 

4955 

4969 

4983 

4997 

5011 

5024 

5038 

32 

5051 

5065 

5079 

5092 

5105 

5119 

5132 

5145 

5159 

5172 

33 

5185 

5198 

5211 

5224 

5237 

5250 

5263 

5276 

5289 

5302 

34 

5315 

5328 

5340 

5353 

5366 

5378 

5391 

5403 

5416 

5428 

35 

5441 

5453 

5465 

5478 

5490 

5502 

5514 

5527 

5539 

5551 

36 

5563 

5575 

5587 

5599 

5611 

5623 

5635 

5647 

5658 

5670 

37 

5682 

5694 

5705 

5717 

5729 

5740 

5752 

5763 

5775 

5786 

38 

5798 

5809 

5821 

5832 

5843 

5855 

5866 

5877 

5888 

5899 

39 

5911 

5922 

5933 

5944 

5955 

5966 

5977 

5988 

5999 

6010 

40 

6021 

6031 

6042 

6053 

6064 

6075 

6085 

6096 

6107 

6117 

41 

6128 

6138 

6149 

6160 

6170 

6180 

6191 

6201 

6212 

6222 

42 

6232 

6243 

6253 

6263 

6274 

6284 

6294 

6304 

6314 

6325 

43 

6336 

6345 

6355 

6365 

6375 

6385 

6395 

6405 

6415 

6425 

44 

6435 

6444 

6454 

6464 

6474 

6484 

6493 

6503 

6513 

6522 

45 

6532 

6542 

6551 

6561 

6571 

6580 

6590 

6599 

6609 

6618 

46 

6628 

6637 

6646 

6656 

6665 

6675 

6684 

6693 

6702 

6712 

47 

6721 

6730 

6739 

6749 

6758 

6767 

6776 

6785 

6794 

6803 

48 

6812 

6821 

6830 

6839 

6848 

6857 

6866 

6875 

6884 

6893 

49 

6902 

6911 

6920 

6928 

6937 

6946 

6955 

6964 

6972 

6981 

EM 

6990 

6998 

7007 

7016 

7024 

7033 

7042 

7050 

7059 

7067 

51 

7076 

7084 

7093 

7101 

7110 

7118 

7126 

7135 

7143 

7152 

52 

7160 

7168 

7177 

7185 

7193 

7202 

7210 

7218 

7226 

7235 

53 

7243 

7251 

7259 

7267 

7275 

7284 

7292 

7300 

7308 

7316 

54 

7324 

7332 

7340 

7348 

7356 

7364 

7372 

7380 

7388 

7396 
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COMMON LOGARITHMS: 550-999 


N 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

55 

7404 

7412 

7419 

7427 

7435 

7443 

7451 

7459 

7466 

7474 

56 

7482 

7490 

7497 

7505 

7513 

7520 

7528 

7536 

7543 

7551 

57 

7559 

7566 

7574 

7582 

7589 

7597 

7604 

7612 

7619 

7627 

58 

7634 

7642 

7649 

7657 

7664 

7672 

7679 

7686 

7694 

7701 

59 

7709 

7716 

7723 

7731 

7738 

7745 

7752 

7760 

7767 

7774 

60 

7782 

7789 

7796 

7803 

7810 

7818 

7825 

7832 

7839 

7846 

61 

7853 

7860 

7868 

7875 

7882 

7889 

7896 

7903 

7910 

7917 

62 

7924 

7931 

7938 

7945 

7952 

7959 

7966 

7973 

7980 

7987 

63 

7993 

8000 

8007 

8014 

8021 

8028 

8035 

8041 

8048 

8055 

64 

8062 

8069 

8075 

8082 

8089 

8096 

8102 

8109 

8116 

8122 

65 

8129 

8136 

8142 

8149 

8156 

8162 

8169 

8176 

8182 

8189 

66 

8195 

8202 

8209 

8215 

8222 

8228 

8235 

8241 

8248 

8254 

67 

8261 

8267 

8274 

8280 

8287 

8293 

8299 

8306 

8312 

8319 

68 

8325 

8331 

8338 

8344 

8351 

8357 

8363 

8370 

8376 

8382 

69 

8388 

8395 

8401 

8407 

8414 

8420 

8426 

8432 

8439 

8445 

70 

8451 

8457 

8463 

8470 

8476 

8482 

8488 

8494 

8500 

8506 

71 

8513 

8519 

8525 

8531 

8537 

8543 

8549 

8555 

8561 

8567 

72 

8573 

8579 

8585 

8591 

8597 

8603 

8609 

8615 

8621 

8627 

73 

8633 

8639 

8645 

8651 

8657 

8663 

8669 

8675 

8681 

8686 

74 

8692 

8698 

8704 

8710 

8716 

8722 

8727 

8733 

8739 

8745 

75 

8751 

8756 

8762 

8768 

8774 

8779 

8785 

8791 

8797 

8802 

76 

j 8808 

8814 

8820 

8825 

8831 

8837 

8842 

8848 

8854 

8859 

77 

8865 

8871 

8876 

8882 

8887 

8893 

8899 

8904 

8910 

8915 

78 

8921 

8927 

8932 

8938 

8943 

8949 

8954 

8960 

8965 

8971 

79 

8976 

8982 

8987 

8993 

8998 

9004 

9009 

9015 

9020 

9025 

80 

9031 

9036 

9042 

9047 

9053 

9058 

9063 

9069 

9074 

9079 

81 

9085 

9090 

9096 

9101 

9106 

9112 

9117 

9122 

9128 

9133 

82 

9138 

9143 

9149 

9154 

9159 

9165 

9170 

9175 

9180 

9186 

83 

9191 

9196 

9201 

9206 

9212 

9217 

9222 

9227 

9232 

9238 

84 

9243 

9248 

9253 

9258 

9263 

9269 

9274 

9279 

9284 

9289 

85 

9294 

9299 

9304 

9309 

9315 

9320 

9325 

9330 

9335 

9340 

86 

9345 

9350 

9355 

9360 

9365 

9370 

9375 

9380 

9385 

9390 

87 

9395 

9400 

9405 

9410 

9415 

9420 

9425 

9430 

9435 

9440 

88 

9445 

9450 

9455 

9460 

9465 

9469 

9474 

9479 

9484 

9489 

89 

9494 

9499 

9504 

9509 

9513 

9518 

9523 

9528 

9533 

9538 


9542 

9547 

9552 

9557 

9562 

9566 

9571 

9576 

9581 

9586 

91 

9590 

9595 

9600 

9605 

9609 

9614 

9619 

9624 

9628 

9633 

92 

9638 

9643 

9647 

9652 

9657 

9661 

9666 

9671 

9675 

9680 

93 

9685 

9689 

9694 

9699 

9703 

9708 

9713 

9717 

9722 

9727 

94 

9731 

9736 

9741 

9745 

9750 

9754 

9759 

9763 

9768 

9773 

95 

9777 

9782 

9786 

9791 

9795 

9800 

9805 

9809 

9814 

9818 

96 

9823 

9827 

9832 

9836 

9841 

9845 

9850 

9854 

9859 

9863 

97 

9868 

9872 

9877 

9881 

9886 

9890 

9894 

9899 

9903 

9908 

98 

9912 

9917 

9921 

9926 

9930 

9934 

9939 

9943 

9948 

9952 

99 

9956 

9961 

9965 

9969 

9974 

9978 

9983 

9987 

9991 

9996 







APPENDIX • E 


Tables of the t Distribution 
and the F Distribution 


Not all sampling distributions of pertinent statistics can be ap¬ 
proximated by the normal curve. In some situations statisticians 
must resort to the use of some other mathematical curve to approxi¬ 
mate a sampling distribution adequately. Two frequently useful 
curves or distributions of this sort are (i) the t distribution and (ii) 
the F distribution. 

Just as the normal curve depends upon its mean and its standard 
deviation, the t distribution depends upon a parameter—generally 
called degrees of freedom {ri). For one degree of freedom there is 
one t distribution, for two degrees of freedom there is another t 
distribution, for three degrees of freedom there is still another, and 
so forth ad infinitum. 

For various t distributions (i.e., for various degrees of freedom: 
1 to 30 and «>) Table E.l shows the value of t for a given area in 
both tails of the curve. This table, for t, is comparable to the table 
of normal deviates on page 502. 

The graphic illustration of a £ distribution on page 510 provides 
an example of the use of Table E.l. 

Just as there are many t distributions, there are many F distribu¬ 
tions—the shape of a particular F distribution depends upon a pair 
of degrees of freedom (% and n 2 ). Table E.2 shows, for various pairs 
of degrees of freedom, the value of F for a given area under the 
curve’s upper tail. 

The graphic illustration of an F distribution on page 510 provides 
an example of the use of Table E.2. 
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GRAPHIC ILLUSTRATIONS OF THE f DISTRIBUTION AND THE F DISTRIBUTION 



As Table E.l indicates, for the t distribution with 6 degrees of 
freedom, .05 of the area in two tails of the curve falls outside of the 
range t — —2.447 to t — 2.447. 



As Table E.2 indicates, for the F distribution with 4 and 20 de¬ 
grees of freedom, .05 of the area under the curve is past F — 2.860 
while .01 of the area is past F = 4.431. 



TABLE E.1. VALUES OF f 
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Index 


A 

A t , 346 

Absolute deviation, 70 
Absolute seasonal, 477 
Acceptable quality level (AQL), 258 
Acceptance number, 260, 264-65 
Acceptance sampling 
acceptable quality level ( AQL), 258 
acceptance number, 260, 264-65 
alternate management objectives, 
269-71 

alternate plans, 263-68 
average sample size, 266-67 
basic principles of, 261-62 
bias in selection procedure, 282 
consumer’s risk, 258 
decision rule, 259-61 
defined, 257 

double sampling plan, 263-64, 266-68 
fixed sample size, 263 
lot tolerance per cent defective 
(. LTPD ), 258 

minimum inspection costs, 269-70 
multiple sampling plan, 263-64, 

266-68 

producer’s risk, 258 

sequential sampling plan, 263, 265-68 
single sampling plan, 263-68 
truncated sampling plan, 263 
Accuracy, 276,303-5 
Action, principle of, 3-6 
Aggregate 

in analytical studies, 60 
as a decision-parameter, 58-62 
in enumerative studies, 60 
estimator for, 199 
formula for, 61-62 
of a population (A), 58-62 
Alpha risk (a-risk), 235 
Alternate hypothesis, 238 
Alternative courses of action, 5-6, 13 
American Management Association, 481 
American Societv for Qualitv Control, 
368 

American Society for Testing Ma¬ 
terials, 368 

American Standards Association, Inc,, 

47,368 


Analysis of variance 
assumptions, 388 
described, 387-91 
F distribution, 391,509 
fixed model, 391 
random model, 391 
variance ratio, 390-91 
Analytical study, 28-31, 48 
Applied Statistics, 299 
Arithmetic mean 
and the aggregate, 65-66 
in analytical studies, 66 
arithmetical properties, 66 
as an average of price relatives, 63-65 
computation from frequency distribu¬ 
tion, 313-16 

confidence interval for, 221-24 
as a decision-parameter, 62-66, 241-48 
in enumerative studies, 66 
estimator for, 199 
expected value, 150 
formula for, 66 
of a population, 62-66 
of a sample, 123-27 
sampling distribution, 175-89 
Array, 67 

Assignable causes, 336 
Association 
and causation, 406-8 
degree of relationship, 403-6 
dependent variable, 402 
independent variable, 402 
meaning of, 401-3 
multiple relationship, 402 
nature of relationship, 403-6 
simple relationship, 402 
Attribute, 32-36 
Automatic recording devices 
audimeter, 301 
photoelectric cell, 301 
program analyzer, 301 
Average; see Arithmetic mean 
Average deviation, 74-75 

B 

Backman, Jules, 94 
Ballowe, James M., 343 
Bancroft, Gertrude, 288 
Bar chart, 37 
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Bassie, V. Lewis, 486 

Beta risk (/3-risk), 235 

Bias; see Procedural bias 

Biased estimator, 203-4 

Binary numbers, 325 

Binomial formula, 144-47 

Bivariate population, 35-36, 402, 435 

Blair, Morris Myers, 121 

Bratt, Elmer Clark, 486 

Brenner, H., 95 

Brinkman, J. S., 344 

Bross, Irwin D. J., 19 

Brunk, Max E., 393 

Built-in deficiency, 276-77 

Burns, Arthur F., 478-79 

Burr, Irving W., 275 

C 

c-chart, 351-54 
Call-backs, 285, 298-300 
Causation and association, 406-8 
Cause and effect, 406-8 
Census, 26, 88-90, 99-100, 276 
Centile, 51 

Chance; see Probability, Risk 
Chance causes, 336 
Chapin, F. Stuart, 288 
Chappell, Matthew N., 300 
Characteristic, 22-25 
Chebycheff, Pafnutti Lvovitch, 83 
Chunk, 103 

Cluster sampling, 118, 209-10 
Cochran, William G., 160, 400 
Coding, 305, 307-8, 314-16, 318-19 
Coefficient of correlation, 432-38 
Coefficient of determination, 435 
Collator, 322, 323 

Columbia Statistical Research Group, 
264 

Column diagram, 39 
Common causes, 336 
Comparative experiments 
analysis of variance, 387-91 
defined, 370 
design, 382-84 

matched (paired) sample design, 382- 
86 

OC curve, 379 
several populations, 386-96 
two population means, 372-79 
two population proportions, 379-82 
Complete count, 12, 26; see also Census 
Complete information, 11, 21 
Completely randomized design, 393-94 
Conditional probability 
defined, 135 
dependent trials, 135 
rule, 135-37 
rule applied, 136-41 


Confidence coefficient, 221 
Confidence interval, 221-25, 427-28, 429 
Conklin, Maxwell R., 325 
Consequences, 6-10 
Consistency of data, 305, 457-58 
Consistent estimator, 199-201 
Constant seasonal, 477 
Consumer’s risk, 258 
Continuing decision problem, 332-34 
Continuous variate, 32-33 
Control chart 
choice of, 350-51 
control limits, 337 
as a decision rule, 337 
frequency of sample selection, 359-60 
historical development, 333-34 
for mean (5-chart), 344-48 
number of defects (c-chart), 351-54 
for proportion defective (p-chart), 
336-44 

p-chart versus x- and R -charts, 350-51 
E-chart, 344, 348-49 
rational subgroup, 358-60 
and risks of incorrect action 
probability limits, 355 
repeated sampling, 356-57 
3-sigma limits, 355 
types of, 355 
sample size, 360 
specification limits, 360-63 
3-sigma limits, 339,355 
as a time series, 447-48 
Controllable causes, 336 
Convenience sampling, 102-3, 105-7 
Correlation 

bivariate normal distribution, 435 
coefficient of determination, 435 
critical values for sample coefficient of, 
437 

estimator for coefficient of, 436-37 
model, 435 
nature of, 403-6 

population coefficient of, 432-35 
rank, 438-41 

sample coefficient of, 436-38 
Cowden, Dudley J., 121, 331, 368, 485 
Cox, Gertrude M., 400 
Cramer, Harald, 153 
Critical value, 238 

Croxton, Frederick E., 121, 331, 485 
Cyclical fluctuations, 454-55, 478-80 

D 

d 2 , 346 

D s and D i} 346 
Dalleck, Winston C., 195 
Data 

external, 92-96 
internal, 91-92 
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Data— Cont. 
primary, 90-96 
seasonally adjusted, 476-77 
secondary, 90-96 
Data accuracy 
extreme values, 306 
inconsistencies, 305 
unlikely values, 305-6 
unusual regularity, 306-7 
Data collection 

automatic recording devices, 301 
direct observation, 297 
mail questionnaire, 299-300 
organization of, 302-5 
panel technique, 300-301 
personal interview, 298-99 
telephone interview, 300 
Data flow, 296-331 
Data processing, 305-9 
Data tabulation, 309-13 
Decision making in the face of uncer¬ 
tainty, 2-6 
Decision-parameter 
defined, 51 

measures used as, 53-83, 372, 379 
sampling distribution of, 189-90 
selection of, 83-84 
states of the world, 51 
statistic contrasted with, 125-26 
Decision problems 
estimation, 12-14, 194-231 
nature of, 1—12 
testing, 12-14, 232-74 
Decision rule 

in acceptance sampling, 259-61 
alternate management objectives, 268- 

71 

in comparative experiments, 374-75, 
380-81 

concept of, 15-16 

in correlation problems, 437-38, 440 
to discard data, 306 
in estimation problems, 197-9ff 
operating characteristic curve, 248-57 
in regression problems, 420-28 
in testing problems, 236-48 
Deming, W. Edwards, 48, 49, 208,294, 331 
De Moivre, Abraham, 78 
Dependent variable, 402 
Design of experiments 
completely randomized, 382 
efficiency of, 391 

matched (paired) samples, 382-86 
matched sample and completely ran¬ 
domized design contrasted, 386 
nature of, 371-72 
types of design 

completely randomized, 393-94 


Design of experiments— Cont . 
types of design— Cont, 

Latin Square, 395-96 
randomized block, 394-95 
Detlefsen, G. R., 95 
Differences 
of means, 372-73 
of proportions, 379-80 
of successive observations, 458-59 
Discrete variate, 32 
Dispersion; see Variability 
Distribution, population, estimation of, 
225-28 

Distribution, sampling, 154-93 
Double sampling, 118-19 
Double sampling plan, 263-64, 266-68 
Duncan, Acheson J., 153, 274, 368, 399 

E 

Editing, 305 
Edwards, G. D., 48 
Efficient estimator, 203-5 
Electronic computer, 323-26 
Elementary unit, 22-25 
Enumerative study, 28-31,48 
Equiprobable assumption, 130-31 
Estimation 

attained precision, 217-20 
decision rule, 197-98 
determining sample size 
finite population, 214-15, 216 
infinite population, 212-14, 216 
estimator, 198-207 
interval estimates, 197, 220-25 
point estimate, 197 
of population distribution, 225-28 
problems, 12-14, 195-97, 210-17 
uses of, 194-96 
Estimator 
for aggregate, 199 
for arithmetic mean, 199 
biased, 203 
consistent, 199-201 
defined, 198 
efficiency of, 203-5 
minimax, 205-7 
for proportion, 199 
ratio, 203-4 
unbiased, 201-4 
for variance, 201-2, 218-19 
Expected value, 148-50, 161-62, 184-85 
Exponential distribution, 81-82 
Extreme values, 68-70, 306 

F 

F distribution, use in analysis of vari¬ 
ance, 391 
F, tables of, 509 
Factorials, 101 
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Federer, Walter T., 393 
Field, 320 

Final report, 326-28 
Finite population, 29-30 
Finite population correction factor, 
170-72,187 

Fisher, Sir Ronald, 76, 371,400 
Fisher, Willard C., 64 
Forecasting, 446-86 
Fraction defective, 339 
Frame, 25-28,113-14 
Free-response question, 307 
Frequency distribution 
of attributes, 36-37 
bimodal, 44-45 
class intervals of, 310-12 
class limits of, 309-10 
classes of, 309, 311-12 
column diagram of, 39 
construction of, 309-12 
defined, 36 

histogram of, 39, 41-45 
J-shaped, 43-44 
multimodal, 44 
relative, 39-40 

skewed or asymmetrical, 42-44 
as a state of the world, 52-53 
symmetrical, 41-42 
unimodal, 44 
U-shaped, 44 
of variates, 37-40 
Frickey, Edwin, 478 

G 

Galton, Sir Francis, 67 
Gauss, Karl Frederick, 78 
Gaussian distribution, 78; see also Nor¬ 
mal distribution 
Gini, Corrado, 1 
Grant, Eugene L., 48,274, 368 
Gray, Percy G., 290,300 
Gretton, Owen C., 325 
Grouped data, 309 
Grubbs, F. E., 259 

H 

Hansen, Morris H., 295 

Hirsch, Werner Z., 87, 153, 193, 230, 445 

Histogram, 39,41-45 

Hochstim, Joseph R., 278 

Hooper, C. E., 300 

Hooperating, 300 

Huff, Darrell, 281, 282, 295 

Hurwitz, William N., 295 

Hypergeometric distribution, 170 

Hypothesis, 237-38 

I 

Incomplete information, 10 
Independent trials, 141-47 


Independent variable, 402 
Industrial Quality Control , 368 
Infinite population, 29-30 
Inspection, 257 
Interpreter, 322 

Interval estimation, 197, 220-25 
Interviewer 
bias, 288-89 

selection, training, and supervision, 
303-4 

J 

Judgment sampling, 102-4 

K 

Kellv, Truman L., 86 
Kendall, M. G., 49,112 
Kev-punch machine, 322 
Kimball, A. W., 295 
Kish, Leslie, 291 
Kroll, Frank W., Jr., 366 

L 

Lansing, John, 291 
Latin Square design, 395-96 
LCL, 337 

Least-squares method, 417-18 
Legendre, Adrien, 417 
Lewis, E. E., 193 
Logarithmic scale, 451-52 
Logarithms, table of, 506-8 
Lot, definition of, 257 
Lot tolerance per cent defective 
(LTPD), 258 

Lower control limit {LCL), 337 
LTPD, 258 

M 

Machine tabulation, 319-26 
Mail questionnaire, 299 
Mandel, B. J., 87,122,485 
Mark sensing process, 322 
Mathematical probability; see Proba¬ 
bility 

McCandless, Boyd, 285 
McCarthy, Phillip J., 19,48, 445 
Mean; see Arithmetic mean 
Mean deviation, 74-75 
Measurement error, 286-87 
Medial average, 472,474 
Median 

computation of, 67-68 

as a decision-parameter, 67-71 

defined, 67-68 

of a population, 67-71 

properties of, 69-70 

of a sample, 203 

Mid-value of a class, 312, 314-19 
Mills, Frederick C., 49,331,485 
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Minimax estimator, 205-7 
Minimum inspection costs, 270 
Mitchell, Wesley C., 478-79 
Mode 

as a decision-parameter, 71 
of a population, 67, 71 
Moving average, 466-70 
Moving seasonal, 477 
Multi observational problems, 31-32 
Multiple sampling plan, 263-64, 266-68 
Multiplier or calculator, 323 
Multivariate population, 36, 402 
Mutually exclusive outcomes, 132-33 
Myers, Robert J., 294 

N 

National Bureau of Economic Research 
479-80 

Neiswanger, William A., 122, 331 
Neter, John, 49, 87, 122, 231, 274 
Newman, Joseph W., 106 
Neyman, J., 274 
No observations, 12, 88-90 
Nonparametric statistics, 83,438-41 
Nonresponse, 282-86 
Nonsampling errors, 275-95 
Nonsense correlation, 406-8 
Normal curve, 165, 500-501; see also 
Normal distribution 
Normal deviates, 165, 502 
Normal distribution 
estimation of population model, 225- 
28 

as population model, 42, 77-83, 225-28 
as sampling distribution in compara¬ 
tive experiments, 373-74 379-81 
384-85 

as sampling distribution of p, 160, 165— 
75 

as sampling distribution in regression 
problems, 419-27 

as sampling distribution of x, 179-84 
and standard deviation, 77-83 
table of normal deviates, 502 
Normal equations, 417 
Null hypothesis, 238 

O 

Observations 
biased, 286-91 
bivariate, 35-36 
of characteristics, 22-25 
continuous, 32-33 
discrete, 32 
multivariate, 36, 402 
numerical, 2 
qualitative, 32-33 
quantitative, 32-33 


Observations— Cont. 
in statistics, 10-12 
univariate, 35-36 

OC curve; see Operating characteristic 
curve 

One-tailed test, 242-43 
Open-ended question, 307-8 
Operating characteristic curve 
calculation of, 249-57 
in comparative experiments, 379 
defined, 248-49 

P 

p-chart, 336-44 
Panel technique, 301 
Parameter, 52; see also Decision- 
parameter 

Parten, Mildred, 295, 307, 331 
Pearson, Karl, 67, 76 
Personal interview, 298-99 
Planning, the principle of, 14-16, 96 
Point estimate, 197 
Polya, G., 153 
Population 
of attributes, 33-35 
bivariate, 35-36, 402 
defined, 20-21 
existent, 49 
finite, 29 

finite twofold, 170-75 
hypothetical, 49 
infinite, 29 
multivariate, 36, 402 
univariate, 35-36 
of variates, 33-35 
Population model 
bimodal, 44-45 

bivariate normal distribution as 
435 

defined, 41 
J-shaped, 43-44 
multimodal, 44 
normal curve as, 42 
skewed or asymmetrical, 42-44 
symmetrical, 41-42 
U-shaped, 44 
unimodal, 44 
uses of, 45-46 
Precision 

accuracy contrasted with, 276-77 
attained, 217-20 
meaning of, 203 
Prestige bias, 290 
Primary data, 90-96 
Primary study, 90-96,195-96 
Probability 
addition rule, 133 
binomial formula, 144-47 
conditional, 134-41 
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Probability —C ont. 
defined, 128-29 
distribution, 154 
estimation of, 129-31 
expected value and, 148-50 
independent trials, 135,141-44 
mathematical, 127-34 
measure, 128,132-34 
and measurement of risk, 140-41 
mutually exclusive outcomes, 132-33 
and the normal curve, 165-70 
theory,122-53 
Probability sampling 
advantages of, 103,107 
defined, 103 
types of, 

cluster sampling, 118 
double sampling. 118-19 
sequential, 119 
simple random. 107-10 
stratified, 116-18 
Procedural bias 
accuracy vs. precision, 276-77 
control of, 276-77 
defined, 275 
sources of 

arithmetical errors, 292 
auspices of study, 289-90 
built-in deficiency, 276-77 
data flow, bias in, 291-92 
improper formulation of problem, 
277-79 

interviewer bias, 288-89 
measurement errors, 286-87 
nonresponse, 282-86 
observations, biased, 286-91 
prestige bias, 290 
questionnaire, 287-88 
selection procedure, bias, 279-82 
Producer’s risk, 258 
Proportion 

in analytical studies, 57 
confidence interval for, 225 
as a decision-parameter, 53-58, 

236-41 

defined, 53-54 

in enumerative studies, 57 

estimator for, 199 

minimax estimator for, 205-6 

point estimate of, 199 

in a population, 50-51, 53-58 

in a sample, 125 

sampling distribution of, 156-64 
Punch card, 319-21 

Q 

Quality control, 333-34; see also Con¬ 
trol chart 


Questionnaire 
bias, 287-88 
described, 302 

fixed alternative questions, 307-8 
free-response or open-ended ques¬ 
tions, 307-8 
Quota sampling 
defined, 105 

stratified sampling contrasted with, 
117-18 

R 

J?-chart, 344, 348-49 
Rand Corporation, The, 112, 503 
Random digits, table of, 503-5 
Random errors, 286-87 
Random numbers, 111-16 
Random process, 103,107 
Random sampling; see Probability sam¬ 
pling 

Random variable 
defined, 124-25 
expected value of, 148-50 
statistic as a, 126-27 
Randomization, 375 
Randomized block design, 394-95 
Range 

of a population, 74 
of a sample, 346-47 
Range chart; see R -chart 
Rank correlation, 438-40 
Ratio estimate, 203-4 
Ratio-to-moving-average method, 471-75 
Recalls, 285, 298-300 
Reciprocals, tables of, 488-99 
Rectangular distribution, 81-82 
Regression 

curvilinear, 405 

judgment applications of, 428-32 
linear, 404 

problems, nature of, 403-6 
sampling procedure for, 413-14 
scatter diagram, 405-6 
simple linear-regression model, 408-11 
Regression line 

confidence interval estimates, 427-28, 
429 

least-squares method, fitted by, 417-18 
for a population, 411 
slope of 

defined, 411-12 
estimator for, 414 
standard error of, 419-20 
standard deviation of regression 
defined, 413 
estimator for, 418-19 
standard error of estimated 7-value 
estimator for, 423-26 
meaning of, 424 
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Regression line— Cont. 
standard error of line 
estimator for, 422-23, 425-26 
meaning of, 422 

work sheet for computation of, 416 
F-intercept 
defined, 411 
estimator for, 414 
Rejection number, 264-65 
Relative seasonal, 477 
Reproducer and summary punch, 323 
Rice, Stuart A., 289 
Risk 

in estimation problems, 194-231 
of incorrect decisions, 11-12 
measurement of, 140-41 
in testing problems, 232-74 
Roberts, Harry V,, 19, 87, 153, 193, 231, 
295, 400, 445 
Rosander, A. C., 193,230 

S 

Sample 

arithmetic mean, 124 
defined, 97 

as part of population, 23 
pattern of variability, 127 
proportion, 125 
selection, 88-122 

size, formulas for determining, 212, 
214, 216, 241,248 
standard deviation, 125 
variance, 125 

Samples, all possible, 100-102 
Sampling 
accuracy of, 98-99 
in analytical study, 29-30 
versus complete count, 99-100 
convenience, 102-7 
defined, 97 
economy of, 97, 98 
in enumerative study, 30 
infinite population, and, 99 
judgment, 102-7 
methods of 
cluster, 118, 209-10 
cost of, 207-10 
double, 118-19 
sequential, 119 
simple random, 107-10 
stratified, 116-18,208-9 
versus no observations, 99 
probability, 12, 102-7 
quota, 105 
reasons for, 96-100 
risk, measurement of, 140-41 
timeliness of, 98 


Sampling distribution 
arithmetic mean 
development of, 175-84 
expected value, 184-85 
normal curve, 179-84 
sample size, 179-84 
sampling risk, 186-88 
standard error, 185-86 
decision-parameter, 189-90 
defined, 154-56 

difference in sample means, 373 
difference in sample proportions, 379- 
80 

hypergeometric, 170 
proportion 

binomial formula, 156-57 
expected value of p, 161-62 
finite population, 170—75 
infinite population, 156-64 
normal distribution, 160 
relation to population propor¬ 
tion, 157-58 

relation to sample size, 158-60 
sampling risk, 167-69,172-74 
standard error, 162-64 
slope of regression line, 420-21 
Sampling error versus nonsam'pling er¬ 
ror, 275-77; see also Procedural bias 
Savage, Leonard J., 19 
Scatter diagram, 405-6, 415-17 
Schedule, 298, 302 
Schweitzer, Henry, 94 
Seasonal movement 
absolute, 477 
adjustment for, 476-77 
constant, 477 
moving, 477 

ratio-to-moving-average method, 471- 
75 

relative, 477 
Secondary data 
defined, 90 
external, 92-96 
internal, 91-92 
principle of planning, 96 
reliability of, 96 
uses of, 91-96 
Secondary studies 
estimation problems, 196-97 
in general, 90-96 

operating characteristic curve, 256 
Sequential sampling, 119, 263, 265-68 
Shewhart, Walter A., 333 
Shubin, John A., 195 
Simmons, Willard A., 208 
Simple linear-regression model, 408-11 
Simple random sampling 
analytical studies, 110 
characteristics of, 108-9 
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Simple random sampling— Cont. 
defined, 108 

enumerative studies, 108-9 
from finite population, 108-9 
from infinite population, 110 
Single sampling plan, 263-68 
Slope 

defined, 411-12 
estimator for, 414 

standard error of estimate of. 419-20 
Slutzky-Yule effect, 469-70 
Smith, B. Babington, 112 
Sorter and sorter counter, 322 
Specification limits, 361-63 
Sprowls, R. Clay, 19,48, 274, 445, 486 
Spurious correlation, 406-8 
Square roots, tables of, 488-99 
Squares, tables of, 488-99 
Standard deviation 

computation from frequency distribu¬ 
tion, 316-19 

as a decision-parameter, 75-77 
defined, 76-77 

and exponential curve, 81-82 
formula for, 76-77 
interpretation of, 77-83 
and the normal curve, 77-83 
of a population, 75-83 
and rectangular distribution, 81-82 
of a sample, 125 

Standard deviation of regression 
defined, 413 
estimate of, 418-19 
Standard error 

of difference in sample means, 373-74, 
377-78 

of difference in sample proportions, 
380-81 

of estimate; see Standard deviation 
of regression 

of estimated regression line, 422-23, 
425-26 

of estimated F-value, 423-25, 426 
of mean 

finite population, 185-86 
infinite population, 187 
and population size, 189 
population standard deviation, 189 
sample size, 189 
of proportion, 162-64 
finite population, 171-72 
infinite population, 163 
and 7T, 164 

and population size, 174-75 
and sample size, 163-64 
of slope of regression line, 419-20 
States of the world, 6-10; see also Deci¬ 
sion-parameter 


Statistic 

decision-parameter contrasted, 125-26 
defined, 124 

as a random variable, 126-27 
Statistical control, 337-341 
and satisfactory product, 360-63 
Statistical decision rule; see Decision 
rule 

Statistical decisions by association, 401- 
45 

Statistical design of comparative experi¬ 
ments, 369-400 
Statistical experiment 

allocation of sample, 374-75 t 
defined, 369 
design of, 371-72 
multifactor, 392 
randomization, 375 

Statistical population; see Population 
Statistical table 

column heading, 312 
footnote, 313 
format, 310-11, 312-13 
source, 313 
stub, 313 
title, 312 
uses of, 309 
Statistics 
as a field, 2-3 

historical notes, 1, 62, 67, 76, 78, 83,263, 
333, 417 

as numerical data, 2-3 
as plural of statistic, 123-27 
Stock, J. Stevens, 290 
Stockton, John R., 121 
Stouffer, Samuel A., 67 
Stratified sampling- 
defined, 116 

quota sampling contrasted, 117-18 
Suchman, Edward A., 285 
Systematic errors, 287 

T 

t, tables of, 509 
Table; see Statistical table 
Tabulator, 322 
Target population, 27-28 
Tchebycheff, Pafnutti Lvovitch, 83 
Tchebycheff’s inequality, 82-83 ’ 
Teamwork, principle of, 16-17 
Telephone interview, 300 
Testing problem 

in acceptance sampling, 257-68 

alpha risk in, 235 

beta risk in, 235 

decision rule, elements of, 238 

defined, 12-14 

error of first kind, 235 

error of second kind, 235 
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Testing problem— Cont. 
one-tailed, 242-43 
two-tailed, 242-43 
type I error, 235 
type II error, 235 
3-sigma limits, 339, 355 
Ties in ranks, 439 
Time series 

absolute changes in, 450-51 
cyclical fluctuations, 454-55, 478-80 
defined, 447 
and forecasting, 446-86 
graphic presentation of, 449-52 
moving average, 466-70 
random movements, 456 
relative changes in, 450-51 
seasonal movements, 455-56,470-78 
serial correlations, 449 
Slutzky-Yule effect, 469-70 
trend element, 453, 458-66 
Tippett, L. H. C., 112,399 
Tolerable error, 210-11 
Torray, Philip, 94 
Trend 
defined, 453 
fitting of, 460-64 
linear, 461-63 
nonlinear, 461-62 
projection of, 464-66 
statistical test for, 458-61 
Two-tailed test, 242-43 
Type I error, 235 
Type II error, 235 

U 

VCL , 337 

Unassignable causes, 336 
Unbiased estimator, 201-4 
Uncertainty 

in estimation problems, 194-231 
meaning of, 3,10-12 
in testing problems, 232-74 
Uncontrollable causes, 336 
Unequal class intervals, 311 
United Nations Subcommission on Sta¬ 
tistical Sampling, 331 
Universe, 21; see also Population 
Upper control limit ( UCL ), 337 


V 

Vance, Lawrence L., 274 
Variability 

as a criterion for decision, 72-73 
measures of 

average deviation, 74-75 
range, 74 

standard deviation, 75-83 
variance, 75-83 
in population, 52, 71-83 
in a process 

assignable-unassignable causes, 336 
controllable-uncontrollable causes, 
336 

and decision making, 336-37 
special-common causes, 336 
Variance 

as a decision-parameter, 75-83 
defined, 76-77 

estimator for, 201-2, 218-19 
formula for, 76-77 
interpretation of, 77 
population, 75-83 
sample, 125 
Variate, 32-36 
Verifier, 322 

W 

Wald, Abraham, 263 
Walker, Helen M., 67,87 
Wallis, W. Allen, 19, 87, 153, 193, 231, 295, 
307, 400, 445 

Wasserman, William, 49,87,122,231 
Welch, Emmett H., 288 
Working population, 27-28 

X 

5-chart, 344-48 

Y 

F-intercept 
defined, 411 
estimator of, 414 
Yates, Frank, 48,104, 331 
Yule, G. Udny, 49 

Z 

165,502 
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